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'TITLE 

F£TTY ACID DESATURASE GENES FROM PLANTS 
FTFTiD OF THE INVENTION 

The invention relates to the preparation and use of 
5 nucleic acid fragments encoding fatty acid desaturase 
enzymes to modify plant lipid composition. 

BACKGROUND OF THE INVENTION 

Plant lipids have a variety of industrial and 
nutritional uses and are central to plant membrane 

10 function and climatic adaptation. These lipids 

represent a vast array of chemical structures, and these 
structures determine the physiological and industrial 
properties of the lipid. Many of these structures 
result either directly or indirectly from metabolic 

15 processes that alter the degree of unsaturation of the 
lipid. Different metabolic regimes in different plants 
produce these altered lipids, and either domestication 
of exotic plant species or modification of agronomically 
adapted species is usually required to economically 

20 produce large amounts of the desired lipid. 

Plant lipids find their major use as edible oils in 
the form of triacylglycerols . The specific performance 
and health attributes of edible oils are determined 
largely by their fatty acid composition. Most vegetable 

25 oils derived from commercial plant varieties are 

composed primarily of palmitic (16:0) , stearic (18:0), 
oleic (1-8:1), linoleic (18:2) and linolenic (18:3) 
acids. Palmitic and stearic acids are, respectively, 
16- and 18-carbon-long, saturated fatty acids . Oleic, 

30 linoleic, and linolenic acids are 16-carbon-long, 

unsaturated fatty acids containing one, two, and three 
double bonds, respectively. Oleic acid is referred to 
as a mono-unsaturated. fatty acid, while linoleic and 
linolenic acids are referred to as poly-unsaturated 

35 fatty acids. The relative amounts of saturated and 



WG 93/11245 



2 



PCT/US92/102S4 



unsaturated fatty acids., in commonly used, edible 
vegetable oils are summarized below (Table 1) : 

TABLE 1 

Percentages of Saturated and Unsaturated Fatty 
Arrids ih the Oils of Selected Oil Crops 





Saturated 


Mono<- 
un saturated 


Poly- 
unsaturated 


Canoia 


6% 


58% 


36% 


Soybean 


15% 


24% 


61% 


Corn 


13% 


25% 


62% 


Peanut 


18% 


48% 


34% 


Safflower 


9% 


13% 


78% 


Sunflower 


9% 


41% 


51% 


Cgtton 


30% 


19% 


51% 


Many 


recent research 


efforts have 


examined the role 



that saturated and unsaturated fatty acids play in 
5 reducing the risk of coronary heart disease. In the 

past, it was believed that mono-unsaturates, in contrast 
to saturates and poly-unsatiirates, had no effect on 
serum cholesterol and coronary heart disease risk. 
Several recent human clinical studies suggest that diets 

10 high in mono-unsaturated fat and low in saturated fat 
may reduce the "bad" (low-density lipoprotein) 
cholesterol while maintaining the "good" (high-density 
lipoprotein) cholesterol (Mattson et al., Journal of 
Lipid Research (1985) 26:194-202). 

15 A vegetable oil low in total saturates and high in 

mono-unsaturates would provide significant health 
benefits to consumers as well as economic benefits to 
oil processors. As an example, canola oil is considered 
a very healthy oil. However, in use, the high level of 

20 poly-unsaturated fatty acids in canola oil renders the 
oil unstable, easily oxidized, and susceptible to 
development of disagreeable odors and flavors 
(Gailliard, 1980, Vol. 4, pp. 85-116 in: Stumpf, P. K., 
Ed.., The Biochemistry of Plants, Academic Press, New 
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York) . The levels of poly-unsaturates may be reduced by 
hydrogenation, but the expense of this process and the 
concomitant production of nutritionally questionable 
trans isomers of the remaining unsaturated fatty acids 
5 reduces the overall desirability of the hydrogenated oil 
(Mensink et al., New England J. Medicine (1990) N323: 
439-445). Similar problems exist with soybean and corn 
oils . 

For specialized uses, high levels of poly- 

10 unsaturates can be desirable. Linoieate and linolenate 
are essential fatty acids in human diets, and an edible 
oil high in these fatty acids can be used for 
nutritional supplements^, for example in baby foods. 
Linseed oil, derived from the Flax plant (Linum 

15 usitatissimum) , contains over 50% linolenic acid and has 
widespread use in domestic and industrial coatings since 
the double bonds of the fatty acids .react rapidly with 
oxygen to polymerize into a soft and flexible film. 
Although the oil content of flax is comparable to canola 

20 (around 40% dry weight of seed), high yields are only 
obtained in warm temperatures or subtropical climates . 
In the USA flax is highly susceptible to rust infection. 
It will be commercially useful if a crop such as soybean 
or canola could be genetically transformed by the 

25 appropriate desaturase gene(s) to synthesize oils with a 
high linolenic acid content. 

Mutation-breeding programs have met with some 
success in altering the levels of poly-unsaturated fatty 
acid levels found in the edible oils of agronomic 

30 species. Examples of commercially grown varieties are 
high (85%) oleic sunflower and low (2%) linolenic flax 
(Knowles, (1980) pp. 35-38 In: Applewhite, T. H., Ed., 
World Conference on Biotechnology for the Fats and Oils 
Industry Proceedings, American Oil Chemists' Society) . 

35 Similar commercial progress with the other plants shown 
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in Table 1 has been largely elusive due to the difficult 
nature of the procedure and the pleiotropic effects of 
the mutational regime on plant hardiness and yield 
potential . 

5 The biosynthesis of the major plant lipids has been 

the focus of much research (Browse et al., Ann. Rev. 
Plant Physiol. Mol. Biol. (1991) 42:467-506). These 
studies show that, with the notable exception of the 
soluble stearoyl-acyl carrier protein desaturase, the 

10 controlling steps in the production of unsaturated fatty 
acids are largely catalyzed by membranes-associated fatty 
acid desaturases. Desaturat ion reactions occur in 
plastids and in the endoplasmic reticulum using a 
variety of substrates including galactolipids, 

15 sulf ©lipids, and phospholipids. Genetic and 

physiological analyses of Arabidopsis thal i ana nuclear 
mutants defective in various fatty acid desaturation 
reactions indicates that most of these reactions are 
catalyzed by enzymes encoded at single genetic loci in 

20 the plant. The analyses show further that the different 
defects in fatty acid desaturation can have profound and 
different effects on the ultra-structural morphology, 
cold sensitivity, and photosynthetic capacity of the 
plants (Ohlrogge, et al., Biochim. Biophys. Acta (1991) 
^25 1082:1-26). However, biochemical characterization of 
the desaturase reactions has been meager . The 
instability of the enzymes and the intractability of 
their proper assay has largely limited researchers to 
investigations of enzyme activities in crude membrane 

30 preparations. These investigations have, however, 
demonstrated the role of-delta-12 desaturase and 
delta-15 desaturase activities in the production of 
linoleate and linolenate from 2-oleoyl-phosphatidyl- 
choline and 2-linoleoyl-phdsphatidylcholine, 

35 respectively (Wang et al., Plant Physiol. Biochem. 
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(1988) 26:777-792). Thus, modification of the 
activities of these enzymes represents an attractive 
target for altering the levels of lipid unsaturation by 
genetic engineering. 
5 Genes from plants for stearoyl-acyl carrier protein 

desaturase, the only soluble fatty acid desaturase 
known, have been described (Thompson, et al., Proc. 
Natl. Acad. Sci. U.S.A. (1991) 88:2578-2582; Shanklin et 
al., Proc. Natl. Acad. Sci. USA (1991) 88:2510-2514). 

10 Stearoylr-cbenzyme-A desaturase genes from yeast, rat, 
and mice have also been described (Stukey, et al., J. 
Biol. Chem. (1990) 265:20144-20149; Thiede, et al., J. 
Biol. Chem. (1986) 261:13230-13235; Kaestner, et al., J. 
Biol. Chem. (1989) 264:14755-1476). No evidence exists 

15 in the public art that describes the isolation of fatty 
acid desaturases other than stearoyl-ACP desaturases 
from higher plants or their corresponding genes. A 
fatty acid desaturase gene from the cyanobacterium, 
Syngrhorvfitis PCC 6803, has also been described (Wada, 

20 et al., Nature (1990) 347:200-203). This gene encodes a 
fatty acid desaturase, designated das. that catalyzes 
the conversion of oleic acid at the 1 position of 
galactolipids to linoleic acid. However, these genes 
have; not proven useful for isolating plant fatty acid 

25 desaturases other than stearoyl-ACP desaturase via 

sequence-dependent protocols, and the present art does 
not indicate how to obtain plant fatty acid desaturases 
other than stearoyl-ACP desaturases or how to obtain 
fatty acid desaturase-related enzymes. Thus, the 

30 present art does not teach how to obtain glycerolipid 
desaturases from plants.* Furthermore, there is no 
evidence that a method to control the nature and levels 
of unsaturated fatty acids in plants using nucleic acids 
encoding fatty acid desaturases other than stearoyl-ACP 

35 desaturase is known in the art . 
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The biosynthesis of the minor plant lipids has been 
less well studied. While hundreds of different fatty 
acids have been found, many from the plant kingdom, only 
a tiny fraction of all plants have been surveyed for 
5 their lipid content (Gunstone, et al., Eds., (1986) The 
Lipids Handbook, Chapman and Hall Ltd., Cambridge). 
Accordingly, little is known about the biosynthesis of 
these unusual fatty acids and fatty acid derivatives. 
Interesting chemical features found in such fatty acids 

10 include, for example, allenic and conjugated double 
bonds, acetylenic bonds, trans double bonds, multiple 
double bonds, and single double bonds in a wide number 
of positions and configurations along the fatty acid 
chain. Similarly, many of the structural modifications 

15 found in unusual lipids (e.g., hydroxylation, 

epoxidatiori, cyclization, etc.) are probably produced 
via further metabolism following chemical activation of 
the fatty acid by desaturation or they involve a 
chemical reaction that is mechanistically similar to 

20 desaturation. For example, evidence for the mechanism 
of hydroxylation of fatty acids being part of a general 
mechanism of enzyme-catalyzed desaturation in eukaryotes 
has been obtained by substituting a sulfur atom in the 
place of carbon at the delta-9 position of stearic acid. 

25 When incubated with yeast cell extracts the thiostearate 
was converted to a 9-sulf oxide (Buist et al. (1987) 
Tetrahedron Letters 28:857-860). This sulfoxidation was 
specific for sulfur at the delta-9 position and did not 
occur in a yeast delta- 9-desaturase deficient mutant 

30 (Buist & Marecak (1991) Tetrahedron Letters 32:891-894). 
The 9-sulfoxide is the sulfur analogue of 9-hydroxyocta- 
decastearate, the proposed intermediate of stearate 
desaturation. Thus fatty-acid desaturase cDNAs may 
serve as useful probes for cDNAs encoding fatty-acid 

35 hydroxylases and other cDNAs which encode enzymes with 



WO 93/1 1245 



7 



PCT/US92/ 10284 



reaction mechanisms similar to fatty-acid desaturation . 
Many of these fatty acids and derivatives having such 
features within their structure could prove commercially 
useful if an agronomically viable species could be 
5 induced to synthesize them by introduction of a gene 
encoding the appropriate desaturase. 

SUMMARY OF THE INVENTION 

Applicants have discovered a means to control the 
nature and levels of unsaturated fatty acids in plants. 

10 Nucleic acid fragments from glycerolipid desaturase 

cDNAs or genes are used to create chimeric genes. The 
chimeric genes may be used to transform various plants 
to modify the fatty acid composition of the plant or the 
oil produced by the plant. More specifically, one 

15 embodiment of the invention is an isolated nucleic acid 
fragment comprising a nucleotide sequence encoding a 
plant delta- 15 fatty acid desaturase or a fatty acid 
desaturase-related enzyme with an amino acid identity of 
50% , 65%, 90% or greater to the polypeptide encoded by 

20 SEQ ID NOS:i, 4, 6, 8, 10, 12, 14, or 16. The isolated 
fragment in these embodiments is isolated f roma plant 
selected from the group consisting of soybean, oilseed 
Brassica species, Arabidbpsis thaliana and corn. 

Another embodiment of this invention involves the 

25 use of these nucleic acid fragments in sequence- 
dependent protocols* Examples include use of the 
fragments as hybridization probes to isolate other 
glycerolipid desaturase cDNAs or genes. A related 
embodiment involves using the disclosed sequences for 

30 amplification of DNA fragments encoding other glycero- 
lipid desaturases. 

Another aspect of this invention involves chimeric 
genes, capable of causing altered levels of the linolenic 
acid in a transformed plant cell, the gene comprising 

35 nucleic acid fragments encoding encoding a plant 
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delta-15 fatty acid desaturase or a fatty acid 
desaturase-related enzyme with an amino acid identity of 
50%, 65%/ 90% or greater to the polypeptide encoded by 
SEQ ID NOS:l, 4, 6, 8, 10, 12, 14, or 16 operably linked 
5 in suitable orientation to suitable regulatory 

sequences. Preferred are those chimeric genes which 
incorporate nucleic acid fragments encoding delta-15 
fatty acid desaturase cDNAs or genes. Plants and oil 
from seeds of plants containing the chimeric genes 

10 described are also claimed. 

Yet another embodiment of the invention involves a 
method of producing seed oil containing altered levels 
of linolenic (18:3) acid comprising: (a) transforming a 
plant cell with a chimeric gene described above; (b) 

15 growing fertile plants from the transformed plant cells 
of step (a) ; (c) screening progeny seeds from the 
fertile plants of step (b) for the desired levels of 
linolenic (18:3) acid, and (d) processing the progeny 
seed of step (c) to obtain seed oil containing altered 

20 levels of the unsaturated fatty acids. Preferred plant 
cells and oils are derived from soybean, rapeseed, 
sunflower, cotton, cocoa, peanut, saf flower, coconut, 
flax, oil palm, and corn. Preferred methods of 
transforming such plant cells would include the use of 

25 Ti and Ri plasmids of J\^r^rrt^ium. elect roporat ion, 
and high-velocity ballistic bombardment. 

The invention also is embodied in a method of 
breeding plant species to obtain altered levels of poly- 
unsaturated fatty acids, specifically linolenic (18:3) 

30 acid in seed oil of oil-producing plants. This method 
involves (a) making a cross between two varieties of an 
oilseed plant differing in the linolenic acid trait; (b) 
making a Southern blot of restriction enzyme digested 
genomic DNA isolated froi^ several progeny plants 

35 resulting from the cross of step (a); and (c) 
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hybridizing the Southern blot with the radiolabeled 
nucleic acid fragments encoding the claimed glycerolipid 
desaturases. 

The invention is also embodied in a method of RFLP 
5 mapping that uses the isolated Arabidopsis thaiiana 
delta-15 desaturase sequences described herein. 

The invention is also embodied in plants capable of 
producing altered levels of glycerolipid desaturase by 
virtue of containing the chimeric genes described 

10 herein. Further, the invention is embodied by seed oil 
obtained from such plants. 

The invention is also embodied in a method of RFLP 
mapping ina genomic KFLP marker comprising (a) making a 
cross between two varieties of plants; (b) making a 

15 Southern blot of restriction enzyme digested genomic DNA 
isolated from several progeny plants resulting from the 
cross of step (a) ; and (c) hybridizing the Southern blot 
with a radiolabelled nucleic acid fragments of the 
claimed fragments. 

20 The invention is also embodied in a method to 

isolate nucleic acid fragments encoding fatty acid 
desaturases and fatty acid desaturase-related enzymes, 
comprising (a) comparing SEQ ID NOS:2, 5, 7, 9, 11, 13, 
15 and 17 with other fatty acid desaturase polypeptide 

25 sequences; (b) identifying the conserved sequence (s) of 
4 or more amino acids obtained in step a; (c) making 
region-specific nucleotide probe is) or oligomer (s) based 
on the conserved sequences identified in step b; and d) 
using the nucleotide probe (s) or oligomers (s) of step c 

30 to isolate sequences encoding fatty acid desaturases and 
fatty-acid desaturase-related enzymes by sequence- 
dependent protocols. The product of the method of 
isolation method described is also part of the 
invention. 
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nESPRTPTThN ^ sgonrNrF DESCRIPTIONS 
The invention can be more fully understood from the 
following detailed description and the Sequence 
Descriptions which form a part of this application. The 
5 Sequence Descriptions contain the one letter code for 
nucleotide sequence characters and the three letter code 
for amino acids in conformity with the IUPAC-IUB 
standard described in Nucleic Acids Research 
13:3021-3030 (19085) and 37 C.F.R. 1.822 which are 

10 incorporated herein by reference. 

SEQ ID NO:l shows the complete 5* to 3' nucleotide 
sequence of 1350 base pairs of the ftrflbidopsis cDNA 
which encodes delta-15 desaturase in plasmid pCF3. 
Nucleotides 46 to 48 are the putative initiation codon 

15 of the open reading frame (nucleotides 46 to 1206) - 
Nucleotides 1204 to 1206 are the termination codon. 
Nucleotides 1 to 45 and 1207 to 1350 are the 5' and 3' 
untranslated nucleotides, respectively. The 386 amino 
acid protein sequence in SEQ ID NO:l is that deduced 

20 from the open reading frame. 

SEQ ID NO: 2 is the deduced peptide of the open- 
reading frame of SEQ ID NO:l. 

SEQ ID NO: 3 is a partial nucleotide sequence of the 
RT*hiriopsis genomic DNA insert in plasmid pFl which 

25 shows the genomic sequence in the region of the 

Arahiriopsis genome that encodes delta-15 desaturase. 
Nucleotides 68-255 are identical to nucleotides 1-188 of 
SEQ ID NO:l. Nucleotides 47 to 49 and 56 to 58 are 
termination codons in the same reading frame as the open 

30 rfeading frame in SEQ ID N0:1. 

SEQ ID NO: 4 shows the 5' to 3' nucleotide sequence 
of the insert in plasmid pACF2-2 of 1525 base pairs of 
the Arahidopsis i-h*liana cDNA that encodes a plastid 
delta-15 fatty acid desaturase. Nucleotides 10-12 and 

35 nucleotides 1348 to 1350 are, respectively, the putative 
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initiation codon and the termination codon cf the open 
reading frame (nucleotides 10 to 1350) . Nucleotides 1 to 
9 and 1351 to 1525 are, respectively, the 5' and 3' 
untranslated nucleotides. 
5 SEQ ID NO: 5 is the deduced peptide of the open 

reading frame of SEQ ID NO: 4 . 

SEQ ID NO: 6 shows the complete 5' to 3* nucleotide 
sequence of 1336 base pairs of the Bras si ca napus. seed 
cDNA, found in plasmid pBNSF3-2, which encodes a 

10 microsomal delta- 15 glycerolipid desaturase . 

Nucleotides 79 to 82 are the putative initiation codon 
of the open reading frame (nucleotides 79 to 1212) . 
Nucleotides 1210 to 1212 are the termination codon* 
Nucleotides 1 to 78 and 1213 to 1336 are the 5' and 3* 

15 unstranslated nucleotides respectively. 

SEQ ID NO: 7 is the deduced peptide of the open 
reading frame of SEQ ID NO: 6. 

SEQ ID NO: 8 is the complete 5' to 3' nucleotide 
sequence of 1416 base pairs of the Brassica napus. seed 

20 cDNA found in plasmid pBNSFd-2 which encodes a plastid 
delta-15 glycerolipid desaturase. Nucleotides 1 to 1215 
correspond to a continuous open reading frame of 404 
amino acids. Nucleotides 1213 to 1215 are the 
termination codon. Nucleotides 1215 to 1416 are the 3 r 

25 untranslated nucleotides. 

SEQ ID NO: 9 is the deduced peptide of the open 
reading frame of SEQ ID NO: 8. 

SEQ ID NO: 10 is the complete nucleotide sequence of 
the soybean ( glycine max ) microsomal delta-15 desaturase 

30 cDNA, found in plasmid pXFl, which the 2184 nucleotides 
of this sequence contain -both the coding sequence and 
the 5* and 3* non-translated regions of the cDNA. 
Nucleotides 855 to 857 are the putative initiation codon 
of the open reading frame (nucleotides 855 to 2000) . 
35 Nucleotides 1995 to 1997 are the termination codon. 
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Nucleotides 1 to 854 and 1998 to 2184 are the 5' and 3' 
mistranslated nucleotides respectively. The 380 amino 
acid protein sequence in SEQ ID NO: 7 is that deduced 
from the open reading frame. 
5 SEQ ID NO: 11 is the deduced peptide of the open 

reading frame in SEQ ID NO: 10. 

SEQ ID NO: 12 is the complete 5' to 3' nucleotide 
sequence of 1676 base pairs of the soybean (Glycine max) 
seed cDNA found in plasmid pSFD-118bwp which encodes a 

10 soybean plastid delta-15 desaturase. Nucleotides 169 to 
1530 correspond to a continuous open reading frame of 
453 amino acids. Nucleotides 169 to 171 are the 
putative initiation codon of the open reading frame. 
Nucleotides 1528 to 1530 are the termination codon. 

15 Nucleotides 1531 to 1676 are the 3* untranslated 

nucleotides. Nucleotides 169 to 382 encode the putative 
plastid transit peptide, based on comparison of the 
deduced peptide with the soybean microsomal delta-15 
peptide. 

20 SEQ ID NO: 13 is the deduced peptide of the open 

reading frame in SEQ ID NO: 12. 

SEQ ID NO: 14 is the complete nucleotide sequence of 
a 396 bp polymerase chain reaction product derived from 
corn seed mRNA that is found in the insert of plasmid 

25 pPCR20. Nucleotides 1 to 31 and 364 to 396 correspond 
to the amplification primers described in SEQ ID NO: 18 
and SEQ ID NO: 19, respectively. Nucleotides 31 to 363 
encode an internal region of a corn seed delta-15 
desaturase that is 61.9% identical to the region between 

30 amino acids 137 and 249 of the Brassica napus delta-15 
desaturase peptide sequence shown in SEQ ID NO: 7. 

SEQ ID NO: 15 is the deduced amino acid sequence of 

SEQ ID NO: 14. 

SEQ ID NO: 16 shows the partial composite 5 1 to 3' 
35 nucleotide sequence of 472 bp derived from the inserts 
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in plasmids pFadx-2 and pYacp7 for Arshidopsi s thaliana 
cDNA that encodes a plastid delta-15 fatty acid 
desaturase. Nucleotides 2-4 and nucleotides 4 68 to 470 
are, respectively, the first and the last codons in the 
5 open reading frame. 

SEQ ID NO: 17 is deduced partial peptide sequence of 
the open reading frame in SEQ ID NO: 16. 

SEQ ID NO: 18 One hundred and twenty eight fold 
degenerate sense 31-mer PGR primer. Nucleotides 1 to 8 
10 correspond to the Bam HI restriction enzyme recognition 
sequence. Nucleotides 9 to 137 correspond to amino acid 
residues 130 to 137 of SEQ ID NO: 6 with a deoxyinosine 
base at nucleotide 11. 

SEQ ID NO: 19 Two thousand and forty eight-fold 
15 degenerate antisense 35-mer PCR primer. Nucleotides 1 
to 8 correspond to the Bam HI restriction enzyme 
recognition sequence. Nucleotides 9 to 35 correspond to 
amino acid residues 249 to 256 of SEQ ID NO: 6 with a 
deoxyinosine base at nucleotide 15. 
20 SEQ ID NO:20 Sixteen-fold degenerate sense 36-^mers 

made to amino acid residues 97-108 in SEQ ID NO: 2. 

SEQ ID NO: 21 Sixteen-fold degenerate sense 36-mers 
made to amino acid residues 97-108 in SEQ ID NO:2. 
SEQ ID NO: 22 Seventy two- fold degenerate sense 
25 18-mers made to amino acid residues 100-105 in SEQ ID 
NO: 2. 

SEQ ID NO: 23 Seventy two-fold degenerate sense 
18-mers made to amino acid residues 100-105 in SEQ ID 
NO: 2. 

30 SEQ ID NO: 24 Seventy two-fold degenerate antisense 

18-mers made to amino acid residues 299-304 in SEQ ID 
NO:2. 

SEQ ID NO: 25 Seventy two-fold degenerate antisense 
18-mers made to amino acid residues 299-304 in SEQ ID 
35 NO:2. 
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SEQ ID NO: 26 Seventy two-fold degenerate antisense 
18-mers made to amino acid residues 304-309 in SEQ ID 
NO:2. 

SEQ ID NO: 27 Seventy two-fold degenerate antisense 
5 18-itiers made to amino acid residues 304-309 in SEQ ID 
NO:2. 

SEQ ID NO: 28 Sixteen-fold degenerate sense 36-mers 
made to amino acid residues 97-108 in SEQ ID NO:2. 

SEQ ID NO: 29 Sixteen-fold degenerate sense 36-mers 
10 made to amino acid residues 97-108 in SEQ ID NO: 2. 

SEQ ID NO: 30 Sixty four-fold degenerate antisense 
38-mers made to amino acid residues 299-311 in SEQ ID 
NO:2. 

SEQ ID NO: 31 Sixty four-fold degenerate antisense 
15 38-mers made to amino acid residues 299-311 in SEQ ID 
NO: 2. 

SEQ ID NO: 32 A 135-mer made as an antisense strand 
to amino acid residues 97-141 in SEQ ID NO:2. 

DETAILED DESCR TPTTQN OF THE INVENTION 

20 Applicants have isolated nucleic acid fragments 

that encode plant fatty acid desaturases and that are 
useful in modifying fatty acid composition in oil- 
producing species by transformation. 

Thus, transfer of the nucleic acid fragments of the 

25 invention or a part thereof that encodes a functional 
enzyme, along with suitable regulatory sequences that 
direct the transciption of their raENA, into a living 
cell will result in the production or over-production of 
plant fatty acid desaturases and will result in 

30 increased levels of unsaturated fatty acids in cellular 
lipids, including triacylglycerols . 

Transfer of the nucleic acid fragments of the 
invention or a part thereof, along with suitable 
regulatory sequences that direct the transciption of 

35 their antisense RNA, into plants will result in the 
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inhibition of expression of che endogenous fatty acid 
desaturase that is substantially homologous with the 
transferred nucleic acid fragment and will result in 
decreased levels of unsaturated fatty acids in cellular 
5 „ lipids, including triacylglycerols . 

Transfer of the nucleic acid fragments of the 
invention or a part thereof, along with suitable 
regulatory sequences that direct the transciption of 
their mRNA, into plants may result in inhibition by 
10 cosuppression of the expression of the endogenous fatty 
acid desaturase gene that is substantially homologous 
with the transferred nucleic acid fragment and may 
result in decreased levels of unsaturated fatty acids in 
cellular lipids, including triacylglycerols. 
15 The nucleic acid fragments of the invention can 

also be used as restriction fragment length polymorphism 
(RFLP) markers in Arabidopsis genetic mapping and plant 
breeding programs. 

The nucleic acid fragments of the invention or 
20 oligomers derived therefrom can also be used to isolate 
other related glycerolipid desaturase genes using DNA, 
RNA, or a library of cloned nucleotide sequences from 
the same or different species by well known sequence- 
dependent protocols, including, for example, methods of 
25 nucleic acid hybridization and amplification by the 
polymerase chain reaction. 

Definitions 

In the context of this disclosure, a number of 
terms shall be used. The term "fatty acid desaturase" 

30 used herein refers to an enzyme which catalyzes the 

breakage of a carbon-hydrogen bond and the introduction 
of a carbon-^carbon double bond into a fatty acid 
molecule. The fatty acid may. be free or esterified to 
another molecule including, but not limited to, acyl- 

35 carrier protein, coenzyme A, sterols and the glycerol 
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moiety of glycerolipids , The term "glyeerolipid 
desaturases" used herein refers to a subset of the fatty 
acid desaturases that act on fatty acyl moieties 
esterified to a glycerol backbone. "Delta-12 
5 desaturase" refers to a fatty acid desaturase that 

catalyzes the formation of a double bond between carbon 
positions 6 and 7 (numbered from the methyl end), (i.e., 
those that correspond to carbon positions 12 and 13 
(numbered from the carbonyl carbon) of an 18 carbon-long 
10 fatty acyl chain or carbon positions 10 and 11 (numbered 
from the carbonyl carbon) of a 16 carbon- long fatty acyl 
chain) . "Delta-15 desaturase" refers to a fatty acid 
desaturase that catalyzes the formation of a double bond 
between carbon positions 3 and 4 (numbered from the 
15 methyl end), (i.e., those that correspond to carbon 

positions 15 and 16 (numbered from the carbonyl carbon) 
of an 18 carbon-long fatty acyl chain and carbon 
positions 13 and 14 (numbered from the carbonyl carbon) 
of a 16 carbon-long fatty acyl chain) . Examples of 
20 fatty acid desaturases include, but are not limited to, 
the microsomal delta-12 and deita-15 desaturases that 
act on phosphatidylcholine lipid substrates; the 
chloroplastic delta-12 and delta- 15 desaturases that act 
on phosphatidyl glycerol and galactoiipids; and other 
25 desaturases that act on such fatty acid substrates such 
as phospholipids, galactoiipids, and sulfolipids. 
"Microsomal desaturase" refers to the cytoplasmic 
location of the enzyme, while "chloroplast desaturase" 
and "plastid desaturase" refer to the plastid location 
30 of the enzyme. These fatty acid desaturases may be 
found in a variety of organisms including, but not 
limited to, higher plants, diatoms, and various 
eukaryotic and prokaryotic microorganisms such as fungi 
and photosynthetic bacteria and algae. The term 
35 "homologous fatty acid desaturases" refers to fatty acid 
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desaturases that catalyze the same desaturation on the 
same lipid substrate. Thus, microsomal delca-15 
desaturases, even from different plant species, are 
homologous fatty acid desaturases. The term 
5 "heterologous fatty acid desaturases" refers to fatty 
acid desaturases that catalyze desaturations at 
different positions and/or on different lipid 
substrates. Thus, for example, microsomal delta-12 and 
delta-15 desaturases, which act on phosphatidylcholine 

10 lipids, are heterologous fatty acid desaturases, even 
when from the same plant. Similarly, microsomal 
delta-15 desaturase, which acts on phosphatidylcholine 
lipids, and chloroplast delta-15 desaturase/ which acts 
on galactolipids, are heterologous fatty acid 

15 desaturases, even when from the same plant, it should 
be noted that these fatty acid desaturases have never 
been isolated and characterized as proteins. 
Accordingly the terms such as "delta-12 desaturase" and 
"delta-15 desaturase" are used as a convenience to 

20 describe the proteins encoded by nucleic acid fragments 
that have been isolated based on the phenotypic effects 
caused by their disruption. The term "fatty acid 
desaturase-related enzyme" refers to enzymes whose 
catalytic product may not be a carbon-carbon double bond 

25 but whose mechanism of action is similar to that of a 
fatty acid desaturase (that is, catalysis of the 
displacement of a carbon-hydrogen bond of a fatty acid 
chain to form a fatty-hydroxygcyl intermediate or end- 
product) • This term is different from "related fatty 

30 acid desaturases", which refers to structural 
similarities between fatty acid desaturases. 

The term "nucleic acid" refers to a large molecule 
which can be single-stranded or double-stranded, 
composed of monomers (nucleotides) containing a sugar, a 
35 phosphate and either a purine or pyrimidine. A "nucleic 
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acid fragment" is a fraction of a given nucleic acid 
molecule. In higher plants, deoxyribonucleic acid (DNA) 
is the genetic material while ribonucleic acid (RNA) is 
involved in the transfer of the information in DNA into 
5 proteins. A "genome" is the entire body of genetic 
material contained in each cell of an organism. The 
term "nucleotide sequence" refers to the sequence of DNA 
or RNA polymers, which can be single- or double- 
stranded, optionally containing synthetic, non-natural 

10 or altered nucleotide bases capable of incorporation 

into DNA or RNA polymers. The term "oligomer* refers to 
short nucleotide sequences, usually up to 150 bases 
long. "Region-specific nucleotide probes" refers to 
isolated nucleic acid fragments derived from a cDNA or 

15 gene using a knowledge of the amino acid regions 

conserved between different fatty-acid desaturases which 
may be used to isolate cDNAS or genes for other fatty- 
acid desaturases or fatty acid desaturase-related 
enzymes using sequence dependent protocols . As used 

20 herein, the term "homologous to" refers to the 

rela:tedness between the nucleotide sequence of two 
nucleic acid molecules or between the amino acid 
sequences of two protein molecules. Estimates of such 
homology are provided by either DNA-DNA or DNA-RNA 

25 hybridization under conditions of stringency as is well 
understood by those skilled in the art (Hames and 
Higgins, Eds. (1985) Nucleic Acid Hybridisation, im- 
press, Oxford, O.K.); or by the comparison of sequence 
similarity between two nucleic acids or proteins, such 

30 as by the method of Needleman et al. (J. Mol. Biol. 
(1970) 48:443-453). As used herein, "substantially 
homologous" refers to nucleotide sequences that have 
more than 90% overall identity at the nucleotide level 
with the coding region of the claimed sequence, such as 

35 genes and pseudo-genes corresponding to the coding 
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regions. The nucleic acid fragments described herein 
include molecules which comprise possible variations, 
both man-made and natural, such as but not limited to 
(a) those that involve base changes that do not cause a 
5 change in ah encoded amino acid, or (b) which involve 
base changes that alter an amino acid but do not affect 
the functional properties of the protein encoded by the 
DNA sequence, (c) those derived from deletions, 
rearrangements, amplifications, random or controlled 

10 mutagenesis of the nucleic acid fragment, and (d) even 
occasional nucleotide sequencing errors . 

"Gene" refers to a nucleic acid fragment that 
expresses a specific protein, including regulatory 
sequences preceding (5* non-coding) and following (3' 

15 non-coding) the coding region. "Fatty acid desaturase 
gene" refers to a nucleic acid fragment that expresses a 
protein with fatty acid desaturase activity. "Native" 
gene refers to an isolated gene with its own regulatory 
sequences as found in nature. "Chimeric gene" refers to 

20 a gene that comprises heterogeneous regulatory and 

coding sequences not found in nature. "Endogenous" gene 
refers to the native gene normally found in its natural 
location in the genome and is not isolated. A "foreign" 
gene refers to a gene not normally found in the host 

25 organism but that is instead introduced by gene 

transfer. "Pseudo-gene" refers to a genomic nucleotide 
sequence that does not encode a functional enzyme. 

"Coding sequence" refers to a DNA sequence that 
codes for a specific protein and excludes the non-coding 

30 sequences. It may constitute an "uninterrupted coding 
sequence", i.e., lacking *an intron or it may include one 
or more introns bounded by appropriate splice junctions. 
An "intron" is a nucleotide sequence that is transcribed 
in the primary transcript but that is removed through 

35 cleavage and re-ligation of the RNA within the cell to 
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create the mature mRNA that can be translated into a 
protein . 

"Initiation codon" and "termination codon" refer to 
a unit of three adjacent nucleotides in a coding 
5 sequence that specifies initiation and chain termination 
respectively, of protein synthesis (mRNA translation) . 
"Open reading frame" refers to the coding sequence 
uninterrupted by introns between initiation and 
termination codons that encodes an amino acid sequence. 

10 "RNA transcript" refers to the product resulting 

from RNA polymerase-catalyzed transcription of a DNA 
sequence. When the RNA transcript is a perfect 
complementary copy of the DNA sequence, it is referred 
to as the primary transcript or it may be a RNA sequence 

15 derived from posttranscriptional processing of the 

primary transcript and is referred to as the mature RNA. 
"Messenger RNA (mRNA) " refers to the RNA that is without 
introns and that can be translated into protein by the^ 
cell. "cDNA" refers to a double-stranded DNA that is 

20 complementary to and derived from mRNA. "Sense" RNA 
refers to RNA transcript that includes the mRNA. 
"Antisense RNA" refers to a RNA transcript that is 
complementary to all or part of a target primary 
transcript or mRNA and that blocks the expression of a 

25 target gene by interfering : with the processing, 

transport and/pr translation of its primary transcript 
or mRNA. The complementarity of an antisense RNA may be 
with any part of the specific gene transcript, i.e., at 
the 5 » non-coding sequence, 3 r non-coding sequence, 

30 introns, or the coding sequence. In addition, as used 
herein, antisense RNA may contain regions of ribozyme 
sequences that increase the efficacy of antisense RNA to 
block gene expression. "Ribozyme" refers to a catalytic 
RNA and includes sequence-specific endoribonucleases . 
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As used herein, "suitable regulatory sequences" 
refer to nucleotide sequences in native or chimeric 
genes that are located upstream (5*)/ within, and/or 
downstream (3 f ) to the nucleic acid fragments of the 
5 invention, which control the expression of the nucleic 
acid fragments of the invention. The term "expression" f 
as used herein, refers to the transcription and stable 
accumulation of the sense (mRNA) or the antisense RNA 
derived from the nucleic acid fragment (s) of the 

10 invention that, in conjunction with the protein 

apparatus of the cell, results in altered levels of the 
fatty acid desaturase (s) . Expression or overexpression 
of the gene involves transcription of the gene and 
translation of the mRNA into precursor or mature fatty 

15 acid desaturase proteins. "Antisense inhibition" refers 
to the production of antisense RNA transcripts capable 
of preventing the expression of the target protein. 
"Overexpression" refers to the production of a gene 
product in transgenic organisms that exceeds levels of 

20 production in normal or non-transformed organisms. 

"Cosuppression" refers to the expression of a foreign 
gene which has substantial homology to an endogenous 
gene resulting in the suppression of expression of both 
the foreign and the endogenous, gene. "Altered levels" 

25 refers to the production of gene product (s) in 

transgenic organisms in amounts or proportions that 
differ from that of normal or non-transformed organisms. 

"Promoter" refers to a DNA sequence in a gene, 
usually upstream (5 1 ) to its coding sequence, which 

30 controls the expression of the coding sequence by 

providing the recognition for RNA polymerase and other 

* factors required for proper transcription. In 
artificial DNA constructs promoters can also be used to 

* transcribe antisense RNA. Promoters may also contain 
35 DNA sequences that are involved in the binding of 
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protein factors which control the effectiveness of 
transcription initiation in response to physiological or 
developmental conditions. It may also contain enhancer 
elements- An "enhancer" is a DNA sequence which can 
5 stimulate promoter activity. It may be an innate 
element of the promoter or a heterologous element 
inserted to enhance the level and/or tissue-specif icity 
of a promoter. "Constitutive promoters " refers to those 
that direct gene expression in all tissues and at all 
10 times- "Tissue-specific" or "development-specific" 
promoters as referred to herein are those that direct 
gene expression almost exclusively in specific tissues , 
such as leaves or seeds, or at specific development 
stages in a tissue, such as in early or late embryo- 
15 genesis, respectively. 

The M 3 r non^coding sequences" refers to the DNA 
sequence portion of a gene that contains a poly- 
adenylation signal and any other regulatory signal 
capable of affecting mRNA processing or gene expression. 
20 The polyadenylation signal is usually characterized by 
affecting the addition of polyadenylic acid tracts to 
the 3' end of the mRNff precursor. 

the term "Transit Peptide" refers to the N-terminal 
extension of a protein that serves as a signal for 
25 uptake and transport of that protein into an organelle 
such as a plastid or mitochondrion. 

"Transformation" herein refers to the transfer of a 
foreign gene into the genome of a host organism and its 
genetically stable inheritance. "Restriction fragment 
30 length polymorphism" refers to different sized 

restriction fragment lengths due to altered nucleotide 
sequences in or around variant forms of genes . 
"Fertile" refers to plants that are able to propagate 
sexually. 
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"Oil-producing species" herein refers to plant 
species which produce and store triacylglycerol in 
specific organs, primarily in seeds- Such species 
include soybean {glycine max), rapeseed and canola 
5 (including Brassica nafiU&r fi. caffiPSStris) , sunflower 
( Helianthus annus) . cotton (GQSSypium hirSUtum) r corn 
(Ztt mays.) , cocoa f Thebbroma cacao) , saf flower 
{Carthamus tinctpr.iua) / oil palm (Elasis. guineensis) , 

coconut palm { Cocos nucifera) , flax (Linum 
10 usitatissimum ) , castor fflicinus communis) and peanut 
{ Arachis h ypoaaea ) . The group also includes non- 
agronomic species which are useful in developing 
appropriate expression vectors such as tobacco, rapid 
cycling Brassica species, and Arabidopsis thaliana, and 
15 wild species which may be a source of unique fatty , 
acids 

"Sequence-dependent protocols" refer to techniques 
that rely on a nucleotide sequence for their utility. 
Examples of sequence-dependent protocols include, but 

20 are not limited to, the methods of nucleic acid and 
oligomer hybridization and methods of DNA and RNA 
amplification such as are exemplified in various "uses of 
the polymerase chain reaction. "PCR product" refers to 
the DNA product obtained through polymerase chain 

25 reaction. 

Various solutions used in the experimental 
manipulations are referred to by their common names such 
as W SSC", "SSPE", "Denhardt's solution", etc. The 
composition of these solutions may be found by reference 

30 to Appendix B of Sambrook, et al. {Molecular Cloning, A 
Laboratory Manual, 2nd ed. (1989), Cold Spring Harbor 
Laboratory Press) . 



T-DNA Mutagenesis and Identification of an 
ArflbidOPSiS Mutant Defective in Delta-15 Dgsafnral-ion 
In T-DNA mutagenesis (Feldraann, et al., Science 
(1989) 243:1351-1354), the integration of T-DNA in the 
5 genome can interrupt normal expression of the gene at or 
near the site of the integration. If the resultant 
mutant phenotype can be detected and shown genetically 
to be tightly linked to the T-DNA insertion, then the 
"tagged" locus and its wild type counterpart can be 
10 readily isolated by molecular cloning by one skilled in 
the art. 

Arabidopsis .thaliana seeds were transformed by 
ftqrobacterium tumefaciens C58Clrif strain harboring the 
avirulent Ti-plasmid pGV3850 : :pAK1003 that has the T-DNA 
15 region between the left and right T-DNA borders replaced 
by the origin of replication region and ampicillin 
resistance gene of plasmid pBR322 f a bacterial kanamycin 
resistance gene, and a plant kanamycin resistance gene 
(Feldmann, et al., Mol. Gen. Genetics (1987) 208:1-9). 
20 Plants from the treated seeds were self-fertilized and 
the resultant progeny seeds, germinated in the presence 
of kanamycin, were self-fertilized to give rise to a 
population, designated T3, that was segregating for 
T-DNA insertions. T3 seeds from approximately 6000 T2 
plants were analyzed for fatty acid composition. One 
line, designated 3707, showed a reduced level of 
linolenic acid (18:3) . One more round of self- 
fertilization of mutant line 3707 produced T4 progeny 
seeds . The ratio of 18:2/18:3 in seeds of the 
homogyzous mutant in T4 population was ca. 14; this 
ratio is ca 1.8 and ca. 23, respectively, in wild-type 
Arabidopsis and Arabidopsis fad 3 mutant [Lemieux et al. 
(1990) Theor. App. Gen. 80:234-240 J obtained via 
chemical mutagenesis. These seeds were planted and 263 
individual plants were analyzed for the presence of 
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nopaline in leaf extracts. TS 'seeds from these plants 
were further analyzed for fatty acid composition and the 
ability to germinate in the presence of kanamycin. The 
mutant fatty acid phenotype was found to segregate in a 
5 1:2:1 ratio, as was germinability on kanamycin. 

Nopaline was found in all plants with an altered fatty 
acid phenotype, but not in wild type segregants. These 
results provided evidence that the locus controlling 
delta-15 desaturation was interrupted by T-DNA in mutant 
10 line 3707. 

Isolation of Arahirinp.; j ff Genomic DNA 
Containing the Genp rnnM-^^f no nAiha-i^ Dpsar „ rff 1 . ^ p 

In order to isolate the gene controlling delta-15 
desaturation from wild-type Arabirinp^ <, . a T-DNA-plant 
15 DNA "junction" fragment containing a T-DNA border 

integrated into the host plant DNA was isolated from 
Arabirton^ mutant 3707. For this, genomic DNA from the 
mutant plant was isolated and completely digested by 
either Bam HI or Sal I restriction enzymes. In each 
case, one of the resultant fragments was expected to 
contain the origin of replication and ampicillin- 
resistance gene of pBR322 as well as the left T-DNA- 
plant DNA junction fragment. Such fragments were 
rescued as plasmids by ligating the digested genomic DNA 
25 fragments at a dilute concentration to facilitate self- 
ligation and then using the ligated fragments to 
transform £. soli cells. Ampicillin-resistant £. coli 
transformants were isolated and screened by colony 
hybridization to fragments containing either the left or 
30 the right' T-DNA border. Of the 192 colonies obtained 
from the plasmid rescue of Sal I digested genomic DNA, 
31 hybridized with the left T-DNA border fragment, 4 
hybridized to the right T-DNA border fragment, and none 
hybridized to both. Of the 85 colonies obtained from 
the plasmid rescue of Bara HI digested genomic DNA, 63 
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hybridized to the left border and none to the right 
border. Restriction analysis of seven rescued plasmids 
that were obtained from the Bam HI digestion and that 
hybridized to the left T-DNA border showed that they 
were indistinguishable and contained 1.4 kb of putative, 
flanking plant DNA. Restriction analysis of another 
rescued plasmid, pSl, that was obtained from the Sal I 
digestion and hybridized only to the left T-DNA border, 
showed that it contained 2.9 kb of putative, flanking 
plant DNA. This flanking DNA had a Bam HI site and a 
Hind III site 1.4 kb and 2.2 kb, respectively, away from 
the left T-DNA border, suggesting that the 1.4 kb 
putative plant DNA in Bam HI rescued plasmids was 
contained within the 2.9 kb putative plant DNA in the 
15 Sal I rescued plasmids. Southern blot analysis of wild 
type and mutant 3707 toabidOBSls. genomic DNA using the 
radiolabeled 1.4 kb DNA fragment as the hybridization 
probe confirmed that this fragment contained plant DNA 
and that the T-DNA integration site was in a 2.8 kb 
20 Bam HI, a 5.2 kb Hind III, a 3.5 kb Sal I, a 5.5 kb 

Eco RI, and an approximately 9 kb Cla I fragment of wild 
type ai-ahiriopsis DNA. Nucleotide sequencing of plasmid 
pSl with a primer made to a left T-DNA border sequence 
revealed that pSl was colinear with the sequence of the 
25 left T-DNA border (Yadav et al. r Proc. Natl. Acad. Sci. 
USA (1982) 79:6322-6326) up to nucleotide position 65, 
which is in the T-DNA border repeats. Approximately 800 
bp of additional sequence in pSl beyond the T-DNA-plant 
DNA junction, that is, in the plant DNA adjoining the 
30 left T-DNA border, showed no significant homology to the 
T-DNA of pGV3850: :pAK1003 and no significant open 
reading frame. 

The nucleic acid fragment from wild-type 
*r*Mri«nsi.s corresponding to the plant DNA flanking 
35 T-DNA in the line 3707 was isolated by screening a 
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lambda phage Arahiriopsis thaliana genomic library with 
the 1.4 kb plant DNA isolated from the rescued plasmids 
as a hybridization probe. Seven positively-hybridizing 
genomic clones were isolated that fell in one of five 
5 classes based on partial restriction mapping. While 

their average insert size was approximately 15 kb, taken 
together they spanned a total of approximately 40 kb of 
genomic DNA, A combination of restriction and Southern 
analyses revealed that the five clones overlapped the 

10 site of integration of the left border of the T-DNA and 
that there was no detectable rearrangement of plant DNA 
in the rescued plasmids as compared to that in the wild 
type genomic plant DNA, One of these lambda phage 
clones, designated 1111, was representative of the 

15 recovered clones and contained an approximately 20 kb 
genomic DNA insert which was more or less symmetrically 
arranged around the site of insertion of the left border 
of the T-DNA. This clone was deposited on November 27, 
1991 with the American Type Culture Collection of 

20 Rockville, Maryland under the provisions of the Budapest 
Treaty and bears accession number ATCC 75167. 

Isolation of Arabidopsis Delta-15 
nftsar.wrase cDNA 
A 5.2 kb Hind III fragment containing wild-type 

25 genomic DNA, which hybridized to the 1 . 4 kb flanking 
plant DNA recovered from line 3707 and which was 
interrupted near its middle by the T-DNA insertion in 
line 3707 , was isolated from lambda phage clone 41A1 and 
cloned into the Hind III site of the pBluescript SK 

30 vector (Stratagene) by standard cloning procedures 
described in Sambrook et*al., Molecular Cloning, A 
Laboratory Manual, 2nd ed. (1989), Cold Spring Harbor 
Laboratory Press) . The resultant plasmid was designated 
pFl. The isolated 5.2 kb Hind III fragment was also 

35 used as a radiolabeled hybridization probe to screen a 
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cDNA library made to poly A + mRNA from 3-day-old 
etiolated ft ff? h ^ n P^ is thallana (ecotype Columbia) 
seedling hypocotyls in a lambda ZAP II vector 
(Stratagene) . Of the several positively-hybridizing 
5 plaques, four strongly-hybridizing ones were subjected 
to plaque purification. Sequences of the pBluescript 
(Stratagene) vector, including the cDNA inserts, from 
each of the purified phage stocks were excised in the 
presence of a helper phage. The resultant phagemids 
10 were used to infect £. coli cells which yielded double- 
stranded plasmids, pCFl, pCF2, pCF3, and pCF4. All four 
were shown to contain at least one approximately 1.3 to 
1.4 kb Not I insert fragment (Not I/Eco RI adaptors were 
used in the preparation of the cDNA library) which 

15 hybridized to the same region of wild-type plant genomic 
DNA present in the isolated phage clones- This region, 
which was near the site of integration of the left T-DNA 
border in line 3707, was on the side of the T-DNA 
insertion opposite to that of the plant DNA flanking the 

20 left T-DNA border isolated previously via plasmid 

rescue. Partial sequence determination of the different 
cDNAs revealed common identity. Since multiple versions 
of only one type of cDNA were obtained from a cDNA 
library made from etiolated tissue which is expected to 

25 express delta-15 desaturation, and since these cDNAs 
hybridized to the genomic DNA that corresponds to the 
site of T-DNA integration in line 3707 which had a high 
linoleic acid/low linolenic acid phenotype, Applicants 
were lead to conclude that the T-DNA in line 3707 

30 interrupted the normal expression of the gene encoding 
delta-15 desaturase. Th6 complete nucleotide sequence 
of one cDNA, designated pCF3 f was determined and is 
shown as SEQ ID NO:l. It reveals an open reading frame 
that encodes a 386 amino acid polypeptide- One of the 

35 sequencing primers made to the pCF3 insert was also used 
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to obtain 255 bp of sequence from pFl that is shown as 
SEQ ID NO: 3. Nucleotides 68 to 255 of the genomic DNA 
in pFl (SEQ ID NO: 3) are identical to nucleotides 1 to 
188 of the cDNA (SEQ ID N0:1), which shows that they are 
5 colinear and that the cDNA is encoded for by the gene in 
the isolated genomic DNA. Nucleotides 113 to 115 in SEQ 
ID NO: 3 are the initiation codon of the largest open 
reading frame corresponding to nucleotides 46-48 in SEQ 
ID NO:l. This is evident from the presence of in-frame 

10 termination codons at nucleotides 47 to 4 9 and 

nucleotides 56 to 58 and the absence of observable 
intron splice junctions in SEQ ID NO: 3. The 
identification of the 386 amino acid polypeptide as a 
desaturase was confirmed by comparing its amino acid 

15 sequence with all the protein sequences found in Release 
19,0 of the SWISSPROTEIN database using the FAST A 
algorithm of Pearson and Lipman (Proc. Natl. Acad. Sci. 
USA (1988) 85:2444-2448) and the BLAST program (Altschul 
et al., J. Mol. Biol. (1990) 215:403-410). The most 

20 homologous protein found in both searches was the desA 
fatty acid desaturase from the cyanobacterium 
Synechocystis PCC6803 (Wada, et al . , Nature (1990) 
347:200-203; Genbank ID:CSDESA; GenBank Accession 
No:X53508) . The 386 amino acid peptide in SEQ ID N0:1 

25 was also compared to the 35 L amino acid sequence of 

by the method of Needleman et al. (J. Mpl. Biol. (1970) 
48:443-453). Over their entire length, these proteins 
were 26% identical, the comparison imposing four major 
gaps in the desA protein sequence. While this overall 

30 homology is poor, homology in shorter stretches was 
better. For instance, in a stretch of 78 amino acids 
the Arahiriopsis delta-15 desaturase (amino acids 78 to 
155 in SEQ ID NO:l) and the dfifiA protein (amino acids 67 
to 144) showed 40% identity and 66% similarity. 
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Homology in yet shorter stretches was even greater as 



shown in Table 2. 

TABLE 2 

Peptide AA positions AA positions Percent 

Lzngth ™ nq:1 i n desA Ident i ty 

12 97-108 86-97 83 

7 115-121 104-110 71 

9 133-141 22-130 56 

11 299-309 282-292 64 



These high percent identities in short stretches of 
amino acids between the cyanobacterial desaturase 
5 polypeptide and SEQ ID NO: 2 suggests significant 
relatedness between the two. 

To analyse the developmental expression of the gene 
encoding mRNA coresponding to SEQ ID NO:l, the cDNA 
insert in plasmid pCF3 was used as a radiolabeled 

10 hybridization probe on mRNA samples from leaf, root, 

germinating seedling, and developing siliques from both 
wild type amd mutant 3707 Arahidopsis plants, 
essentially as described in Maniatis et al., Molecular 
Cloning, A Laboratory Manual (1982) Cold Spring Harbor 

15 Laboratory Press, The results indicated that while the 
mRNA corresponding to SEQ ID NO:l is detected in all 
tissues from the mutant plant, its levels are lower than 
in wild-type tissues. This is consistent with the 
observation that the fatty acid mutation in line 3707 is 

20 leaky relative to the known Arahidopsis lad 2 mutant 
obtained via chemical mutagenesis. These results 
confirmed that the T-DNA in line 3707 had interrupted 
the normal expression of a fatty acid desaturase gene. 
Based on the fatty acid phenotype of homozygous mutant 

25 line 3707, Applicants concluded that the cDNA insert in 
pCF3 encoded the delta-15 desaturase. Further, 
Applicants concluded rhat it was the microsomal delta-15 
desaturase, and not the chloroplastic delta-15 
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desaturase, since: a) the mutant phenotype was 
expressed strongly in the seed but expressed poorly, if 
at all> in the leaf of line 3707, and b) the delta-15 
desaturase polypeptide, by comparison to the dasA 
5 polypeptide, did not have an N-terminal extension of a 
transit peptide expected for a nuclear-encoded 
chloroplast desaturase . 

The identity of SEQ ID NO: 2 as the ftrabidopsis 
microsomal delta-15 desaturase was confirmed by its 

10 biological overexpression in plant tissues. For this, 

the 1.4 kB Not I fragment of plasmid pCF3 containing the 
delta-15 desaturase cDNA was placed in the sense 
orientation behind either the CaMV 35S promotor, to 
provide constitutive expression, or behind the promoter 

15 for the gene encoding soybean a' subunit of the 

P-conglycinin (IS) seed storage protein, to provide 

embryo-specific expression. The chimeric genes 35S 
promoter/sense SEQ ID NO: 1/3 1 nopaline synthase and 
P-conglycinin/sense SEQ ID NO: 1/3 ' phaseolin were then 

20 transformed into plant cells by ftgrobflCterium 

tumefaciens 's binary Ti plasmid vector system [Hoekema 
et al. (1983) Nature 303:179-180; Bevan (1984) Nucl. 
Acids Res. 12:8711-8720]. 

To confirm the identity of SEQ ID N0:1 and to test 

25 the biological effect of its overexpression in a 
heterologous plant species, the chimeric genes 35S 
promoter/sense SEQ ID NO: 1/3' nopaline synthase was 
transformed into a binary vector, which was then 
transferred into Aorobacterium Mimpfnriens strain R1000, 

30 carrying the Ri plasmid pRiA4b from ftgrobacterium 
rhizoaenes [Moore et al.* (1979) Plasmid 2:617-626). 
5 Carrot ( Daucus carota L.) cells were transformed by 

co-cultivation of carrot root disks with strain R1000 
' carrying the chimeric gene by the method of Petit et al. 

35 (1986) [Mol. Gen. Genet. 202:388-393). Fatty acid 
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analyses of transgenic carrot "hairy" roots show that 
overexpression of Arabidoosis microsomal delta-15 
desaturase can result in over 10-fold increase in 18:3 
at the expense of 18:2. 
5 To complement the delta-15 desaturation mutation in 

the T-DNA mutant line 3707 and to test the biological 
effect of overexpression of SEQ ID N0:1 in. seed, the 
embryo-specific promoter/SEQ ID NO: 1/3 » phaseolin 
chimeric gene was transformed into a binary vector, 

10 which was then transformed into the avirulent 

Agrobacterium strain LBA4404/pAL4404 [Hoekema et al. 
(1983) Nature 303:179-180]. Roots of line 3707 were 
transformed by th<> engineered ftgrohflftteriunir transformed 
plants were selected and grown to give rise to seeds. 

15 Fatty acid analysis of the seeds from two plants showed 
that the one out of six seeds in each plant showed the 
mutant fatty acid phenotype, while the remaining seeds 
show more than 10-fold increase in 18:3 to ca. 55%. 
While the sample size is small, this segregation 

20 suggests Mendelian inheritance of the fatty acid 

phenotype. While most of the increase occurs at the 
expense of 18:2, some of it also occurs at the expense 
of 18:1. Thus, overexpression of this gene in oils 
crops, especially canola, which is a close relative of 

25 ftrahirfopsis , is also expected to result in the high 
levels of 18:3 that are found in specialty oil of 
linseed. 

Comparisons of the sequence of the 386 amino acid 
polypeptide by the method of Needleman et al. (J. Mol. 

30 Biol.. (1970) 48:443-453) with those for the microsomal 
stearoyl-CoA (delta-9) dSsaturases from rat, mouse and 
yeast revealed 21%, 19%, and 17% identities, 
respectively. While the membrane-associated ftfab i tiQPS i s 
delta-15 desaturase protein showed significant but 

35 limited homology to the dfisA protein, it showed no 
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significant homology to the soluble stearoyl-ACP 
(delta-9) desaturases from higher plants, including one 
from flrabidopsis. 

Comparison of partial nucleotide sequences of 
5 plasmids pFl and pSl showed that the left T-DNA 
border: plant DNA junction is ca. 700 bp from the 
initiaton codon in SEQ ID NO:l. To determine the 
position of the other T-DNA: plant DNA junction with 
respect to the pFl sequence, the T-DNA: plant DNA 

10 junction fragment was isolated. Genomic DNA from mutant 
line 3.7.07, isolated as described previously, was 
partially digested by restriction enzyme Mbo I to give 
an average fragment size of ca. 15 kB. The fragment 
ends were partially-filled with dGTP and gATP by Klenow 

15 and cloned into Xho I half-sites of LambdaGEM®-ll 
(Promega Corporation) following the manufacturer's 
protocol. The phage library was titered and used 
essentially as described in Ausubel et al. [Current 
Protocols in Molecular Biology (1989) John Wiley 6 

20 Sons] . The genomic phage library was screened with 
radiolabeled PCR product, ca. 0.6 kB, derived from 5 ! 
end of the gene in pFl. This product spans from 3 bp to 
the right of where the left-T-DNA border inserted to 15 
bp to the left of nucleotide position 1 in SEQ ID NO:l. 

25 Southern blot analysis of DNA from one of the purified, 
positively^hybridizing phages following Eco RI 
restriction digestion and electrophoresis showed that a 
4 kB Eco RI fragment hybridized to the 0.6 kB PCR 
product. The Eco RI fragment was subcloned and subject 

30 to, sequence analyses. Comparison of the sequences 

derived from this fragment, pFl and pSl showed that the 
insertion of T-DNA resulted in a 56 bp deletion at the 
site of insertion and that the T-DNA interrupted the 
Arabidopsis gene 711 bp 5' to the initiaton codon in SEQ 

35 ID NO:l. Thus, the T-DNA inserts 5' to the open reading 
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frame, consistent with the leaky expresssion of the gene 
encoding SEQ ID N0:1 and the leaky fatty acid phenotype 
in mutant 3707. While the left T-DNA : p lant DNA junction 
is precise, that is without any sequence rearrangement 
5 in either the left T-DNA border or the flanking plant 
DtiA, the other T-DNA: plant DNA junction is complex and 
not fully characterized. 

Plasmid pCF3 was deposited on December 3, 1991 with 
the American Type Culture Collection of Rockville, 

10 Maryland under the provisions of the Budapest Treaty and 
bears accession number ATCC 68875, 

Using a rah irtoos is Delta-15 Desaturase cDNA as a 
Hybridization Probe to Isolate cDNAs Encoding 
Related Dftfiaturafi frfi-from Arabidopsis 

15 The 1.4 kb Not I insert fragment isolated from 

plasmid pCF3 was purified, radiolabeled, and used to 
screen approximately 80,000 clones from the cDNA library 
made to poly A + mRNA from 3-day-old etiolated 
Arabidopsis t-.ha -liana as described above, except that 

20 lower stringency hybridizations (1 M NaCl, 50 mM Tris- 
HC1, pH 7.5, 1% SDS, 5% dextran sulfate, 0.1 mg/mL 
denatured salmon sperm DNA and 50°C) and washes 
(sequentially with 2X SSPE, 0.1% SDS at room temperature 
for 5 min and then again with fresh solution for 10 min, 

25 and finally with 0.5X SSPE, 0.1% SDS at 50°C for 5 min.) 
were used. Approximately 17 strongly-hybridizing and 17 
weakly-hybridizing plaques were identified in the 
primary screen. Four of the weakly-hybridizing plaques 
were picked and subjected to one or two further rounds 

30 of screening with the radiolabeled probe as above until 
they were pure. To ensure that these were not delta-15 
desaturase clones, they were further analyzed to 
determine whether they hybridized to an 18 bp oligomer 
specific to the 3' non-coding region of delta-15 , 

35 desaturase cDNA (pCF3) . After autoradiography of the 
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filters, one of the clones was found not to hybridize to 
this probe. This clone was picked, and a plasmid clone 
containing the cDNA insert was obtained as described 
above. Restriction analysis of this plasmid, designated 
5 pCM2, showed that it had an approximately 1.3 kb cDNA 
insert which lacked a 0.7 kb Nco I - Bgl II fragment 
characteristic of the Arabidopsis deita-15 desaturase 
cDNA of pCF3. (This fragment corresponds to the DNA 
located between the Nco I site at nucleotides 474 to 479 

10 and the Bgl II site at nucleotides 1164 to 1169 in SEQ 
ID NO: 1) . Partial nucleotide sequences of single 
strands from the 5* region and 3' region of pCM2 
revealed that the cDNA insert was incomplete and that it 
encoded a polypeptide that is similar to, but distinct 

15 from, that encoded by the cDNA in pCF3. In order to 
isolate a full-length version of the cDNA in plasmid 
pCM2, the 1.3 kB Not I fragment from plasmid pCM2 
containing the cDNA insert was isolated and used as a 
radiolabeled hybridization probe to rescreen the same 

20 Arabidopsis cDNA library as above. Three strongly 
hybridizing plaques were purified and the plasraids 
excised as described previously. The three resultant 
plasmids were digested by Not I restriction enzyme and 
shown to contain cDNA inserts ranging in size between 1 

25 kB and 1.5 kB, Complete nucleotide sequence 

determination of the cDNA insert in one of these 
plasmids, designated pACF2-2, is shown in SEQ ID NO: 4. 
SEQ ID NO: 4 shows the 5' to 3' nucleotide sequence of 
base pairs of the Arabidopsis thalians cDNA which 

30 encodes a fatty acid desaturase. Nucleotides 10-12 and 
nucleotides 1358 to 1350 are, respectively, the putative 
initiation codon and the termination codon of the open 
reading frame (nucleotides 10 to 1350) . The open 
reading frame was confirmed by comparison of its deduced 

35 amino acid sequences with that of the related delta-15 
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fatty acid desaturase from soybean in this application. 
Nucleotides 1 to 9 and 1351 to 1525 are, respectively, 
the 5' and 3 1 untranslated nucleotides. The 446 amino 
acid protein sequence in SEQ ID NO: 5 is that deduced 
5 from the open reading frame in SEQ ID NO: 4 and has an 
estimated molecular weight of 51 kD. Alignment of SEQ 
ID N0S:2 and 5 shows an overall homology jof 
approximately 80% and that the former has an 
approximately 55 amino acid long N-terminal extension, 
10 which is deduced to be a transit peptide found in 
nuclear-encoded plastid proteins . 

To analyse the developmental expression of the gene 
corresponding to SEQ ID NO : 4 f this sequence was used as 
a radiolabeled hybridization probe on mRNA samples from 

15 leaf, root, germinating seedling, and developing 
siliques from both wild type and mutant line 3707 
Arahidopsis plants, essentially as described in Maniatis 
et al. [Molecular Cloning, A Laboratory Manual (1982) 
Cold Spring Harbor Laboratory Press] - The results 

20 indicated that', in contrast to the constitutive 

expression of the gene encoding SEQ ID NO:l, the mRNA 
corresponding to SEQ ID NO: 4 is abundant in green 
tissues, rare in roots and leaves, and is about three- 
fold more abundant in leaf than that of SEQ ID N0:1. 

25 The cDNA in plasmid pCM2 was also shown to hybridize 

pblymorphically to genomic dna from ftrflbidopsis tha li ana 
(ecotype Wassileskija and marker line W100 ecotype 
Landesberg background) digested with Eco RI. It was 
used as a RFLP marker to map the genetic locus for the 

30 gene encoding this fatty acid desaturase in Arabidopsis. 
A single genetic locus was positioned corresponding to 
this desaturase cDNA . Its location was thus determined 
to be on chromosome 3 between the lambda AT228 and 
cosmid c3838 RFLP markers, "north" of the glabrous locus 

35 (Chang et al., Proc. Natl. Acad. Sci. USA (1988) 
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85:6856-6860; Nam et al., Plant Cell <1989) 1:699-705). 
This approximates the region to which Arabidopsis fatty 
acid desaturase fad 2 . fad and fad B mutations map 
[Somerville et al., (1992) in press]. Unsuccessful 
5 efforts to clone the microsomal delta-12 fatty acid 
desaturase using cDNA inserts of pCF3 and pACF2-2 
alongwith the above data led Applicants to conclude that 
the cDNA in pACF2-2 encodes a plastid delta-15 fatty 
acid desaturase that corresponds to the fad D locus- 

10 This conclusion will be confirmed by biological 
expression of the cDNA in pACF2-2. 

Plasmid pCM2 was deposited on November 27, 1991 
with the American Type Culture Collection of Rockville, 
Maryland under the provisions of the Budapest Treaty and 

15 bears accession number ATCC 68852. 

The 1.4 kb, 1.3 kB, and 1.5 kB Not I cDNA insert 
fragments isolated from plasmids pCF3, pCM2 and pACF2-2 
were purified, radiolabeled, and used several times to 
screen at low stringency as described above two 

20 different cDNA libraries: one was made to poly A + mRNA 
from 3-day-old etiolated Arahidopsis tha liana 
("etiolated" library) as described above and one made to 
polyA + mRNA from the above-ground parts of Arabidopsis 
tha liana plants, which varied in size from those that 

25 had just opened their primary leaves to plants which had 
bolted and were flowering {Elledge et al. (1991) Proc. 
Natl. Acad Sci. USA 88:1731-1735]. The cDNA inserts in 
the library were made into an Xho I site flanked by Eco 
RI sites in lambda Yes vector [Elledge et al, (1991) 

30 Proc. Natl. Acad Sci. USA 88:1731-1735] ("leaf" 

library) . Several plaques from both libraries that 
hybridized weakly and in duplicate lifts to both SEQ ID 
NOS:l and 4 were subjected to plaque purification. 
Phagemids were excised from the pure phages from 

35 "etiolated" library as described above. Plasmids were 
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excised from the purified phages of the "leaf" library 
by site^specif ic recombination using the cre-lox 
recombination system in £. CQlx strain BNN132 [Elledge 
et al. (1991) Proc* Natl. Acad Sci. USA 88:1731-1735]. 
5 In all cases, nucleotide sequencing of the cloned DMA 
revealed clones either identical to SEQ ID N0S:1 or 4 or 
unrecognizable sequences . 

In another set of experiments ca. 400,000 phages in 
the "leaf" library was screened with SEQ ID N0S:1 and 4 
10 at low stringency (26 C, 1 M Na+, 50%\formamide) and 

high stringency (42 C, 1 M Na+, 50% formamide) . Of the 
several positive signals on the primary plaque lifts, 11 
showed high stringency hybridization to SEQ ID N0:1, 35 
showed high stringency hybridization to SEQ ID NO: 4, and 
15 39 hybridized to both at low stringency only. Twenty 

seven plaques of the low stringency signals came through 
a secondary low-stringency screen, 17 of which were used 
to make DNA from excised plasmids . Of the 7 plasmid DNA 
were sequenced, 8 were unrecognizable sequences, 5 were 
20 identical to SEQ ID NO:l, 2 were identical to SEQ ID 
NO: 2, and 2 were identical to one another and related 
but distinct to SEQ ID NOS:l and 4. The novel 
desaturase sequence, designated pFad-x2, was also 
isolated from the "leaf" library independently by using 
25 as a hybridization probe a 0.6 kB PGR product derived by 
polymerase chain reaction on poly A+- RNA made from both 
canola seed as well as Arabidopsis leaves, as described 
elsewhere in this application, using degenerate 
oligomers made to conserved sequences between plant 
30 delta-15 desaturases and the cyanobacterial des. A 

desaturase. The PCR-derived plasmid, designated pYacp7, 
was sequenced partially from both ends. Comparison of 
the sequences of pFad-x2 and pYacp7 revealed that the 
two independently cloned cDNAs contained an identical 
35 sequence that was related to the other delta-15 
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desaturases and that both were incomplete cDNAs. A 
partial composite sequence derived from both plasmids, 
pFadx-2 and pYacp7, is shown in SEQ ID NO: 16 as a 5' to 
3' nucleotide sequence of 472 bp. Nucleotides 2-4 and 
5 nucleotides 468 to 470 are, respectively/ the first and 
the last codons in the open reading frame. This open 
reading frame is shown in SEQ ID NO: 17. Comparison of 
SEQ ID NO: 17 to the other delta-15 desaturase 
polypeptides disclosed in this application by the method 

10 of Needleman et al. [J* Mol. Biol. (1970) 48:443-453) J 
using gap weight and gap length weight values of 3.0 and 
Oil, respectively. The overall identities are between 
65% and 68% between SEQ ID NO: 17 and the microsomal 
delta- 15 desaturases from Arabidopsis. canola and 

15 soybean and the overall identities are between 77% and 
87% between SEQ ID NO: 17 and the plastid delta-15 
desaturases from Arabidopsis. canola and soybean. In 
addition SEQ ID NO: 17 has ah N-terminal peptide 
extension compared to the microsomal delta-15 

20 desaturases that shows homology of the transit peptide 
sequence in Arabidopsis plastid delta-15 desaturase. On 
the basis of these comparisons it is deduced that SEQ ID 
NO: 16 encodes a plastid delta-15 desaturase. There is 
genetic data in Arabidopsis suggesting the presence of 

25 two loci for plastid delta-15 desaturase. The full- 
length version of SEQ ID NO: 16 can be readily isolated 
by one skilled in the art. The biological effect of 
introducing SEQ ID NO: 16 or its full-length version into 
plants will be used to confirm its identity. 

30 Flasmid pYacp7 was deposited on 20 November 1992 

with the American Type Culture Collection of Rockville, 
Maryland under the provisions of the Budapest Treaty and 
bears accession number ATCC 69129. 
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Using Arahvrinpsis Delta- 15 Desaturase cDNAs 
as Hybridization Probes to Isolate 

Dfrsa-fcnraae rDNAs from Other Plant Soeeies 
For the purpose of cloning the Brass ica HOCUS, seed 
S cDNAs encoding delta^lS fatty acid desaturases, the cDNA 
inserts from pCF3 and pGM2 were isolated by polymerase 
chain reaction from the respective plasmids r 
radiolabeled, and used as hybridization probes to screen 
a lambda phage cDNA library made with poly A + mRNA from 
10 developing Brassica napilS. seeds 20-21 days after 

pollination . This cDNA library was screened several 
times at low stringency, using the Arabidopsis cDNA 
probes mentioned above* One of the Brasssica nagus. 
cDNAs obtained in the initial screens was used as probe 
15 in a subsequent high stringency screen. 

Arabidopsis pCM2 insert was radiolabeled and used 
as probe to screen approximately 300,000 plaques under 
low stringency hybridization conditions . The filter 
hybridizations were performed in 50 mM Tris pH 7.6, 6X 
20 SSC, 5X Denhardt's, 0*5% SDS r 100 ug denatured calf 

thymus DNA at S0°C overnight, and the posthybridization 
washes were carried out in 6X SSC, 0.5% SDS at room 
temperature for 15 min, then repeated with 2X SSC, 0*5% 
SDS at 45°C for 30- rain, and then repeated twice with 
25 0.2X SSC, 0.5% SDS at 50°C for 30 rain. Five strongly- 
hybridizing phages were obtained. These were plaque 
purified and used to excise the phagemids as described 
in the manual of the pBluescriptll Phagemid Kit from 
Stratagene (Stratagene 1991 catalogue, item 212205) . 
30 One of these, designated pBNSF3-2, contained a 1.3 kb 
insert. pBNSF3-f2 was sequenced completely on both 
strands and the nucleotide sequence is shown in SEQ ID 
NO: 6. Plasmid pBNSF3-2 was deposited on 27 November 
1991 with the American Type Culture Collection of 
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Rockville Maryland, USA under the provisions of the 
Budapest Treaty and bears the accession number 68854. 

An additional low stringency screen using pCM2 
probe provided eight strongly hybridizing phages. One 
5 of these, designated pBNSFd 8 f contained a 0.4kb insert. 
pBNSFd-r8 was sequenced completely on one strand, this 
nucleotide sequence showed significant divergence from 
the sequence SEQ ID NO: 6 in the homologous region, which 
suggested that it corresponded to a novel Brass tea napus 

10 seed desaturase different from that shown in SEQ ID 
NO: 6. pBNSFd-8 insert was radiolabelled and used as 
hybridization probe in a high stringency screen of the 
Brassica ^ap us seed cDNA library. The hybridization 
conditions were identical to those of the low stringency 

15 screen described above except for the temperature of the 
final two 30 min posthybr idi zat ion washes in 0.2x SSC, 
0.5% SDS was increased to 60°C. This screen resulted in 
three strongly hybridizing phages that were purified and 
excised. One of the excised plasinids pBNSFd-3 contained 

20 a 1.4kb insert that was sequenced completely on both 
strands. SEQ ID NO: 8 shows the complete nucleotide 
sequence of pBNSFd-2. 

Using Arabidoosis Delta-15 Desaturase cDNA as a 
Hybridization Probe to Isolate a Glycerolipid 

25 Desaturas e r:DNA from Soybean 

A cDNA library was made to poly A + mRNA isolated 
from developing soybean seeds, and screened essentially 
as described above $ except that filters were 
prehybridized in 25 mL of hybridization buffer 

30 consisting of 50mM Tris-HCl, pH 7.5, 1 M NaCl, 1% SDS, 
5% dextran sulfate and 0-. 1 mg/mL denatured salmon sperm 
DNA (Sigma Chemical Co.) at 50°C for 2 h r Radiolabeled 
probe prepared from pCF3 as described above was added, 
and allowed to hybridize for 18 h at 50°C. The probes 

35 were washed twice at room temperature with 2X SSPE, 1% 
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SDS for five min followed by washing for 5 min at 50°C 
in 0.2X SSPE, 1% SDS. Autoradiography of the filters 
indicated that there was one strongly hybridizing 
plaque , and approximately five weakly hybridizing 
5 plaques. The more strongly hybridizing plaque was 
subjected to a second round of screening as before, 
except that the final wash was for 5 min at 60°C in 0.2X 
SSPE, 1% SDS . Numerous, strongly hybridizing plaques 
were observed, and one, well-isolated from other phage, 
10 was picked for further analysis. 

Sequences of the pBluescript vector from the 
purified phage, including the cDNA insert, were excised 
in the presence of a helper phage and the resultant 
phagemid was used to infect XL-1 Blue cells, 

15 DNA from the plasmid, designated pXFl, was made by the 
alkaline lysis miniprep procedure described in Sambrook 
et al. (Molecular Cloning, A Laboratory Manual, 2nd ed- 
(1989) Cold Spring Harbor Laboratory Press) . The 
alkali-denatured double-stranded DNA from pXFl was 
20 completely sequenced on both strands. The insert of 
pXFl contained a stretch of 1783 nucleotides which 
contained an unknown open-reading frame and also 
contained a poly-A stretch of 16 nucleotides 3 f to the 
open reading frame, from nucleotides 1767 to 1783, 
25 followed by an Eco RI restriction site. The 2184 bases 
that followed this Eco RI site contained a 1145 bp open 
reading frame which encoded a polypeptide of about 68% 
identity to, and colinear with, the ftrabidopsis delta-15 
desaturase polypeptide listed in SEQ ID No:2. The 
30 putative start methionine of the 1145 bp open-reading 
frame corresponded to the start methionine of the 
Arahidopsis microsomal delta-15 peptide and there were 
no amino acids corresponding to a plastid transit 
peptide 5' to this methionine. When the insert in pXFl 
35 was digested with Eco RI four fragments were observed, 
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fragments of approximately 370 bp and 1400 bp fragments, 
derived from the first 1783 bp of the insert in pXFl, 
and fragments of approximately 600 bp and 1600 bp 
derived from the the other 2184 nucleotides of the 
5 insert in pXFl. Only the 600 bp and 1600 bp fragments 
hybridized with probe derived from pCF3 on Southern 
blots. It was deduced that pXFl contained two different 
cDNA inserts separated by an Eco RI site and the second 
of these inserts was a 2184 bp cDNA encoding a soybean 
10 microsomal delta- 15 desaturase. The complete nucleotide 
sequence of the 2184 bp soybean microsomal delta- 15 cDNA 
contained in plasmid pXFl is listed in SEQ ID No: 10. 
Plasmid pXFl was deposited on December 3, 1991 with the 
American Type Culture Collection of Rockville, Maryland 
15 under the provisions of the Budapest Treaty and bears 
accession number ATCC 68874 . 
Using Soybean Microsomal Delta-15 Desaturase cDNA as a 
Hybridization Probe to Isolate 
cDNAs Encoding Related Desaturases from Soybean 
20 A 1.0 kb fragment of DNA corresponding to part of 

the coding region of the. soybean microsomal delta-15 
desaturase cDNA contained in plasmid pXFl, was excised 
with the restriction enzyme Hha I and gel purified. The 
fragment was labeled with 32 P as described above and 
25 used to probe a soybean cDNA library as described above. 
Autoradiography of the filters indicated that there were 
eight hybridizing plaques and these were subjected to a 
second round of screening. Sequences of the pBluescript 
vector from all eight of the purified phages, including 
30 the cDNA inserts, were excised in the presence of a 
helper phage and the resultant phagemids were used to 
infect E. coli XL-1 Blue cells . DNA from the plasmids 
was made by the alkaline lysis miniprep procedure 
described in Sarabrook et al. (Molecular Cloning/ A 
35 Laboratory Manual, 2nd ed. (1989) Cold Spring Harbor 
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Laboratory Press) * Restriction analysis showed they 
contained inserts ranging from 1.0 kb to 3.0 kb in size. 
One of these inserts, designated pSFD-118bwp, contained 
an insert of about 1700 bp. The alkali-denatured 
5 double-stranded DNA from pSFD-118bwp was completely 
sequenced on both strands, shown in SEQ ID NO: 12. The 
insert of pSFD-118bwp contained a stretch of 1675 
nucleotides which contained an open-reading frame 
encoding a polypeptide/ shown in SEQ ID NO: 13, of about 

10 80% identity with, and colinear with, the ArabidQPSis 

plastid delta-15 desaturase polypeptide listed in SEQ ID 
No; 5. The open- reading frame also encoded amino acids 
corresponding to a plastid transit peptide at the 5 f end 
of the open-reading frame. The transit peptide was 

15 colinear with, and shared some homology to, the transit 
peptide described for the Arabidopsis plastid delta-15 
glycerblipid desaturase. The complete nucleotide 
sequence of the 1675 bp soybean plastid delta-15 
glycerolipid desaturase cDNA is listed in SEQ ID No: 12. 

20 Comparison of the different delta-15 desaturase 

sequences disclosed in the application by the method of 
Needleman et al. (J. Mol. Biol. (1970) 48:443-453) using gap 
weight and gap length weight values of 3.0 and 0.1, 
respectively, reveals the relatedness between them as shown 
25 in Table 3. 
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Percent Identities Between Different Delta- 15 
Fatty Acid Desatu rases at the Amino Acid Level 
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Si 
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a3 
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aD 




67 
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69 




c3 






68 


68 


68 




CD 








68 


74 




a3, 


ad, c3, 


cD, s3 and sD 


refer, 


respectively, t 



SEQ ID NO: 2 ( Arahidopsis microsomal delta-15 
desaturase), SEQ ID NO: 5 (ftrflfridQPSis plastid delta-15 



5 desaturase), SEQ ID NO:7 (canola microsomal delta-15 
desaturase), SEQ ID NO: 9 (canola plastid delta-15 
desaturase), SEQ ID N0:11 (soybean microsomal delta-15 
desaturase), and SEQ ID NO: 13 (soybean plastid delta-15 
desaturase) . Based on these comparisons, the delta-15 

10 desaturases, of both microsomal and plastid types, have 
overall identities of 65% or more at the amino acid 
levels, even when from different plant species. 

Isolation of Nucleotide Sequences Encoding 
Homologou s and Heterologous Glycerol loid Desaturases 

15 Fragments of the instant . invention may be used to 

isolate cDNAs and genes of homologous and heterologous 
glycerolipid desaturases from the same species as the 
fragment of the invention or from different species:. 
Isolation of homologous genes using sequence-dependent 

20 protocols is well-known in the art. Southern blot 
analysis revealed that the Arahidopsis microsomal 
delta-15 desaturase cDNA (SEQ ID N0:1) hybridized to 
genomic DNA fragments of .corn and soybean. In addition, 
Applicants have demonstrated that it can be used to 

25 isolate cDNAs encoding seed microsomal delta-15 

desaturases from Brassica napus (SEQ ID NO: 6) and 
soybean (SEQ ID NO:10) . Thus, one can isolate cDNAs and 
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genes for homologous glycerolipid desaturases from the 
same or different higher plant species, especially from 
the oil-producing species. 

More importantly, one can use the fragments of the 
5 invention to isolate cDNAs and genes for heterologous 
glycerolipid desaturases, including those found in 
plastids. Thus, Arabidopsis microsomal delta-15 
desaturase cDNA (SEQ ID N0:1) was successfully used as a 
hybridization probe to isolate cDNAs encoding the 

10 related plastid delta-15 desaturases from Arabidopsis 

(SEQ ID NO: 4) and Bras a tea uaEUS (SEQ ID NO: 8) , and the 
soybean microsomal delta-15 soybean (SEQ ID NO: 10) was 
successfully used to isolate soybean cDNA encoding 
plastid delta-15 desaturase (SEQ ID NO: 12). 

15 m a particular embodiment of the present 

invention, regions of the nucleic acid fragments of the 
invention that are conserved between different 
desaturases may be used by one skilled in the art to 
design a mixture of degenerate oligomers for use in 

20 sequence-dependent protocols aimed at isolating nucleic 
acid fragments encoding other homologous or heterologous 
glycerolipid desaturase cDNA's or genes- For example, 
by comparing all desaturase polypeptides one can 
identify stretches of amino acids that are conserved 

25 between them, and then use the conserved amino acid 

sequence to design oligomers, both short degenerate or 
long ones, or "guessmers" as known by one skilled in the 
art (see Sambrook et al., (Molecular Cloning, A 
Laboratory Manual, 2nd ed. (1989), Cold Spring Harbor 

30 Laboratory Press). Such oligomers and* "quessmers" may 
be used as hybridization -probes as known to one skilled 
in the art. 

For example, comparison of cyanobacterial desA and 
plant delta-15 desaturases revealed a particularly well 
35 conserved stretch of amino acids (amino acids 97-108 in 
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SEQ ID N0:1). SEQ ID NOS:20 and 21 represent two sets 
of 3 6-mers each 16-fold degenerate flriade to this region. 
End- labeled oligomers represented in SEQ ID NOS:20 and 
21 were mixed and used as hybridzation probes to screen 
5 Arabidopsis cDNA libraries. Most of the positively- 
hybridising plaques also hybridized to cDNAs encoding 
Arabidopsis microsomal and plastid delta- 15 desaturases 
(SEQ ID N0S:1 and 4). However, the use of SEQ ID NOS:20 
and 21 did not give consistent and reproducible results. 

10 A 135 base^long oligomer (SEQ ID NO: 32) was also made as 
an antisense strand to a longer stretch of the same 
conserved region, amino acids 97 to 141 in SEQ ID NO:l 
(FVLGHDCGHGSFSDIPLLNSVVGHILHSFILVPYHGWRISHRTHH) . At 
positions of ambiguity, the design used either 

15 deoxyinosines or most frequently used codons based on 
the codon usage in Arabidopsis genes. When used as a 
hybridization probe, the 135-mer hybridized to all 
plaques that also hybridized to cDNAs encoding 
Arabidopsis microsomal and plastid delta- 15 desaturases 

20 (SEQ ID NOS:l and 4). In addition, it also hybridized 
to plaques that did not hybridize to SEQ ID NOS:l and 
4) . The latter were purified and excised as described 
previously,. Nucleotide sequencing of the cDNA inserts 
in the resultant plasraids revealed DNA sequences that 

25 did not show any relatedness to any desaturase. 

For another example , in the polymerase chain 
reaction (Innis, et al., Eds, (1990) PCR Protocols: A 
Guide to Methods and Applications, Academic Press, San 
Diego) , two short pieces of the present fragment of the 

30 invention can be used to amplify a longer glycerolipid 
desaturase DNA fragment from DNA or RNA. The polymerase 
chain reaction may also be performed on a library of 
cloned nucleotide sequences with one primer based on the 
fragment of the invention and the other on either the 

35 poly A + tail or a vector sequence. These oligomers may 
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be unique sequences or degenerate sequences derived from 
the nucleic acid fragments of the invention. The longer 
piece of homologous glycero lipid desaturase DNA 
generated by this method could then be used as a probe 
5 for isolating related glycerolipid desaturase genes or 
cDNAs from Arahidopsis or other species. The design of 
oligomers, including long oligomers using deoxyinosine, 
and "guessroers" for hybridization or for the polymerase 
chain reaction are known to one skilled in the art and 
10 discussed in Sambrook et al., (Molecular Cloning, A 
Laboratory Manual, 2nd ed. (1989), Cold Spring Harbor 
Laboratory Press) . Stretches of conserved amino acids 
between delta-15 desaturase and other desaturases, 
especially desA , allow for the design of such oligomers. 
15 For example, conserved stretches of amino acids between 
desA and delta-15 desaturase, discussed above, are 
useful in designing long oligomers for hybridization as 
well as shorter ones for use as primers in the 
polymerase chain reaction. In this regard, the 

20 conserved amino acid stretch of amino acids 97 to 108 of 
SEQ ID NO: 2 is particularly useful. Other conserved 
~ regions in SEQ ID NO: 2 useful for this purpose are amino 
acids 299 to 309, amino acids 115 to 121, and amino 
acids 133 to 141. Amino acid stretch 133 to 141 in SEQ 

25 ID NO: 2 shows especially good homology to several 

desaturases. For example* in this stretch, amino acids 
133, 137, 138, 140 and 141 are conserved in plant 
delta-15 desaturases, cyanobacterial dfiSAr yeast and 
mammalian microsomal stearoyl-CoA desaturases. - 

30 Comparison of cyanobacterial des A and plant delta-15 
desaturases revealed two -particularly well conserved 
stretch of amino acids (amino acids 97-108 and amino 
acids 299-311 in SEQ ID NO:l) that can be used for PCR. 
The following sets of PCR primers were made to these 

35 regions: 
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SEQ 
TP WO 


Length 


Fold 
Detrenerary 


AA positions 
in 

seo rn NO: 2 


AA 5>prnif»nro 


20 


36 


16 


97-108 <S> 


FVLGHDCGHGSF 


21 


36 


16 


97-108 (S) 


FVLGHDCGHGSF 


28 


36 


16 


97-108 (S) 


FVLGHDCGHGSF 


29 


36 


16 


97-108 (S). 


FVLGHDCGHGSF 


22 


18 


72 


100-105 (S) 


GHDCGH 


23 


18 


72 


100-105 (S) 


GHDCGH 




lfi 


72 


20Q_^AA fact 


HDIGTrt 


25 


18 


72 


299-304 (AS) 


HDIGTH 


216 


23 


416 


304-309 (AS) 


HVTHHL 


27 


23 


416 


304-309 (AS) 


HVIHHL 


30 


38 


64 


299-311 (AS) 


HDIGTHV1HHLFP 


31 


38 


64 


299-311 (AS) 


HDIGTHVIHHLFP 



In one experiment , PCRs were performed using SEQ ID 
N03:22 and 23 as sense primers and either SEQ ID NOS:24 
and 25 or SEQ ID NOS:26 and 27 as antisense primers on 
poly A+ RNA purified from both Arabidopsis leaf and 
5 cariola developing seeds. All PCRs resulted in PCR 
products of the correct size (ca. 630 bp) , The PCR 
products from Arabidopsis and canola were purified and 
used as radiolabeled hybridization probes to screen the 
Lambda Yes Arabidopsis cDNA library, as described above. 

10 This led to the isolation of a pure phage, which was 
excised to give plasmid pYacp7. The cDNA insert in 
pYacp7 was partially sequenced- It's sequence showed 
that it encoded an incomplete desaturase polypeptide 
that was identical to another cDNA (in plasmid pFadx-2) 

15 isolated by low-stringency hybridization as described 
previously. The composite sequence derived from the 
partial sequences from the cDNA inserts in pFadx-2 and 
pYacp7 is shown in SEQ ID NO: 16 and the polypeptide 
encoded by it in SEQ ID NO: 17. As discussed previously, 

20 SEQ ID NO: 17 is a putative plastid delta-15 desaturase. 
This is further supported by Southern blot analysis 
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using radiolabeled cDNA inserts from either pCF3, 
PACF2-2, or pYacp7 on Arabidopsis genomic DNA digested 
with one of several enzymes . It shows that the 
different inserts hybridize to different restriction 
5 fragments and that only the inserts from pACF2-2 and 
pYacp7 show some dross -hybridization . 

In another PCR experiment, PCR was performed using 
ca. 80 pinoles each of SEQ ID NOS:28 and 29 as sense 
primers and ca. 94 pmoles each of SEQ ID NOS:30 and 31 
10 as antisense primers on poly Af RNA purified from 

Arabidopsis mutant line 3707. This was performed using 
GeneAmp® RNA PCR Kit (Perkin Elmer Cetus) following 
manufacturer's protocol and using the following program: 
a) 1 cycle of 2 min at 95°C, b) 35 cycles of 1 min at 
15 95°C (denaturation) , 1 min at 50°C (annealing) and 1 min 
at 65°C (extension), and c) 1 cycle of 7 min at 65°C. 
The resulting PCR product, of the correct size (ca. 630 
bp), was purified, radiolabeled, and used as a 
hybridization probe on a Southern blot of Arabidopsis 
20 genomic DNA as described above. While it hybridized to 
restriction fragments that also hybridized to SEQ ID 
N0S:1 (Arabidopsis microsomal delta- IS desaturase) , 4 
(Arabidopsis plastid delta-15 desaturase) , and 16 
(Arabidopsis plastid delta-15 desaturase), it also 
25 hybridized to novel fragments that did not hybridze to 
previously cloned desaturase cDNAs. However, even after 
several attempts, the radiolabeled PCR product did not 
hybridize to any hovel cDNA clone when used as a probe 
on different Arabidopsis cDNA libraries: in all cases 
30 ithybridzed only to plaques that also hybridized to the 
known desaturase cDNAs. Furthermore, the PCR product 
was subcloned into a plasmid vector and after screening 
about a 100 of these, none gave rise to a clone with a 
novel desaturase sequence. 
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The isolation of other glycerolipiri desaturases 
will become easier as more examples of glycerolipid 
desaturases are isolated using the fragments of the 
invention. Knowing the conserved amino acid sequences 
5 from diverse desaturases will also allow one to identify 
more and better consensus sequences. Such sequences can 
be used to make hybridization probes or amplification 
primers which will further aid in the isolation of 
different glycerolipid desaturases, including those from 
10 non-plant sources such as fungi, algae, and even 

cyanobacteria, as well as other membrane^associated 
desaturases from other organisms. 

The function of the diverse nucleotide fragments 
encoding glycerolipid desaturases that can be isolated 
15 using the present invention can be identified by 
transforming plants with the isolated desaturase 
sequences, linked in sense or antisense orientation to 
suitable regulatory sequences required for plant 
expression, and observing the fatty acid phenotype of 
20 the resulting transgenic plants. Preferred target 

plants for the transformation are the same as the source 
of the isolated nucleotide fragments when the goal is to 
obtain inhibition of the corresponding endogenous gene 
by antisense inhibition or cosuppression . Preferred 
25 target plants for use in expression or overexpression of 
the isolated nucleic acid fragments are plants with 
known mutations in desaturation reactions, such as the 
ftrabidopsis desaturase mutants, mutant flax deficient in 
delta-15 desaturation, or mutant sunflower deficient in 
30 delta-12 desaturation. Alternatively, the function of 
the isolated nucleic acid fragments can be determined 
similarly via transformation of other organisms, such as 
yeast or cyanobacteria, with chimeric genes containing 
the nucleic acid fragment and suitable regulatory 
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sequences followed by analysis of fatty acid composition 
and/or enzyme activity, 

Overexpression of the Glycerolipid 
Desaturase Enzyme s in Transgenic Species 

5 The nucleic acid fragment (s) of the instant 

invention encoding functional glycerolipid 
desaturase (s) , with suitable regulatory sequences, can 
be used to overexpress the enzyme (s) in transgenic 
organisms. Such recombinant DNA constructs may include 
10 either the native glycerolipid desaturase gene or a 

chimeric glycerolipid desaturase gene isolated from the 
same or a different species as the host organism. For 
overexpression of glycerolipid desaturase (s) , it is 
preferable that the introduced gene be from a different 
15 species to reduce the likelihood of cosuppression. For 
example, overexpression of delta-15 desaturase in 
soybean, rapeseed, or other oil-producing species to 
produce altered levels of polyunsaturated fatty acids 
may be achieved by expressing RNA from the entire cDNA 
20 found in pCF3. Similarly, the isolated nucleic acid 
fragments encoding glycerolipid desaturases from 
Arabidopsis r rapeseed, and soybean can also be used by 
. one skilled in the art to obtain substantially 
homologous full-length cDNAs, if not already obtained, 
25 as well as the corresponding genes as fragments of the 
invention. These, in turn, may be used to overexpress 
the corresponding desaturases in plants. One skilled in 
the art can also isolate the coding sequence (s) from the 
fragment (s) of the invention by using and/or creating 
30 sites for restriction endonucleases, as described in 
Sambrdok et al., (Molecular Cloning, A Laboratory 
Manual, 2nd ed. (1989) , Cold Spring Harbor Laboratory 
Press). For example, the fragment in SEQ ID NO:l in 
plasmid pCF3 is flanked by Not I sites and can be 
35 isolated as a Not I fragment that can be introduced in 
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the sense orientation relative to suitable plant 
regulatory sequences. Alternatively, sites for Nco I 
(S'-CCATGC-O 1 ) or Sph I (5 1 -GCATGC-3 1 ) that allow 
precise removal of coding sequences starting with the 
5 initiating codon w ATG tt may be engineered into the 
fragment (s) of the invention. For example, for 
utilizing the coding sequence of delta-15 desaturase 
from pCF3, an Sph I site can be engineered by 
substituting nucleotides at positions 44, 45, and 49 of 

10 SEQ ID NO : 1 with G, C, and C, respectively. 

Inhibition of Plant Target 
ftftnes by Use of Antisense RNA 
Antisense RNA has been used to inhibit plant target 
genes in a tissue-specific manner (see van der Krol et 

15 al., Blotechniques (1988) 6:958-976). Antisense 

inhibition has been shown using the entire cDNA sequence 
(Sheehy et al., Proc. Natl. Acad. Sci. USA (1988) 
85:8805-8809) as well as a partial cDNA sequence (Cannon 
et al., Plant Molec. Biol. (1990) 15:39-47). There is 

20 also evidence that the 3' non-coding sequences (Ch'ng 
et al., Proc. Natl. Acad. Sci. USA (1989) 
86:10006-10010) and fragments of 5» coding sequence, 
containing as few as 41 base-pairs of a 1.87 kb cDNA 
(Cannon et al., Plant Molec. Biol. (1990) 15:39-47), can 

25 play important roles in antisense inhibition. 

The use of antisense inhibition of the glycerolipid 
desaturases may require isolation of the transcribed 
sequence for one or more target glycerolipid desaturase 
genes that are expressed in the target tissue of the 

30 target plant. The genes that are most highly expressed 
are the best targets for* antisense inhibition. These 
genes may be identified by determining their levels of 
transcription by techniques, such as quantitative 
analysis of mRNA levels or nuclear run-off 

35 transcription, known to one skilled in the art. 
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For example, antisense inhibition of delta-15 
desaturase in ftrassica napus resulting in altered levels 
of polyunsaturated fatty acids may be achieved by 
expressing antisense RNA from the entire or partial cDNA 
5 found in pBNSF3-2 . 

Inhibition of Plant Target 
fr>n«*s hy Cosuppression 
The phenomenon of cosuppression has also been used 
to inhibit plant target genes in a tissue-specific 
10 manner. Cosuppression of an endogenous gene using the 
entire cDNA sequence (Napoli et al., The Plant Cell 
(1990) 2:279-289; van der Krol et al., The Plant Cell 
(1990) 2:291-299) as well as a partial cDNA sequence 
(730 bp of a 1770 bp cDNA) (Smith et al., Mol. Gen. 
15 Genetics (1990) 224:477-481) are known. 

The nucleic acid fragments of the instant invention 
encoding glycerolipid desaturases, or parts thereof/ 
with suitable regulatory sequences, can be used to 
reduce the level of glycerolipid desaturases, thereby 
20 altering fatty acid composition, in transgenic plants 
which contain an endogenous gene substantially 
homologous to the introduced nucleic acid fragment. The 
experimental procedures necessary for this are similar 
to those described above for the overexpression of the 
25 glycerolipid desaturase nucleic acid fragments. For 
example, cosuppression of delta-15 desaturase in 
Brass tea napus resulting in altered levels of 
polyunsaturated fatty acids 0 may be achieved by 
expressing in the sense orientation the entire or 
30 partial seed delta-15 desaturase cDNA found in pBNSF3-2. 
fiAiAflMoq Host s . Promoters and Enhancers 
A preferred class of heterologous hosts for the 
expression of the nucleic acid fragments of the 
invention are eukaryotic hosts, particularly the cells 
35 of higher plants . Particularly preferred among the 
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higher plants are the oil-producing species, such as 
soybean f Glygine max) , rapeseed (including Rrflfifi i ca 
napus r a. namp^Rfris ) , sunflower (Hfil i an thus annus ) , 
cotton fGossypium lursiitum) r corn tZsa mays) * cocoa 
5 / Thpnhroma cacao l. saf flower fCarthamus tinctorius), oil 
palm f Riaeis giHh^nsis> , coconut palm (fiacas. nucifera ) , 
flax f Linum n«Hf mum) , and peanut (Arachlfi 

hypogaga) * 

Expression in plants will use regulatory sequences 

10 functional in such plants. The expression of foreign 
genes in plants is well-established (De Blaere et al., 
Meth. Enzymol. (1987) 153:277-291). The source of the 
promoter chosen to drive the expression of the fragments 
of the invention is not critical provided it has 

15 sufficient transcriptional activity to accomplish the 

invention by increasing or decreasing, respectively, the 
level of translatable roRNA for the glycerolipid 
desaturases in the desired host tissue. Preferred 
promoters include (a) strong constitutive plant 

20 promoters, such as those directing the 19S and 35S 

transcripts in cauliflower mosaic virus (Odell et al., 
Nature (1985) 313:810-812; Hull et al Virology (1987) 
86:482-493), and (to) tissue- or developmentally-specific 
promoters. Examples of tissue-specific promoters are 

25 the light- inducible promoter of the small subunit of 

ribulose 1 , 5-bis-phosphate carboxylase (if expression is 
desired in photosynthetic tissues) , the maize zein 
protein promoter (Matzke et al.,- EMBO J. (1984) 
3:1525-1532), and the chlorophyll a/B binding protein 

30 promoter (Lampa et al., Nature (1986) 316:750-752) . 

Particularly preferred promoters are those that 
allow seed-specific expression. This may be especially 
useful since seeds are the primary source of vegetable 
oils and also since seed-specific expression will avoid 

35 any potential deleterious effect in non-seed tissues. 
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Exanples of seed-specific promoters include, but are not 
limited to, the promoters of seed storage proteins, 
which can represent up to 90% of total seed protein in 
many plants. The seed storage proteins are strictly 
5 regulated, being expressed almost exclusively in seeds 
in a highly tissue-specific and stage-specific manner 
(Higgins et al., Ann. Rev. Plant Physiol. (1984) 
35:191-221; Goldberg et al., Cell (1989) 56:149-160). 
Moreover, different seed storage proteins may be 
10 expressed at different stages of seed development. 

Expression of seed-specific genes has been studied 
in great detail (See reviews by Goldberg et al.. Cell 
(1989) 56:149-160 and Higgins et al./ Ann. Rev. Plant 
Physiol. (1984) 35:191-221). There are currently 

15 numerous examples of seed-specific expression of seed 
storage protein genes in transgenic dicotyledonous 
plants. These include genes from dicotyledonous plants 
for bean b-phaseolin (Sengupta^Gopalan et al., Proc. 
Natl. Acad. Sci. USA (1985) 82:3320-3324; Hoffman et 

20 al., Plant Mol. Biol. (1988) 11:717-729), bean lectin 
(Voelker et al., EMBO J. (1987) 6:3571-3577), soybean 
lectin (Okamuro et al., Proc. Natl. Acad. Sci. USA 
(1986) 83:8240-8244), soybean Kunitz trypsin inhibitor 
(Perez-Grau et al., Plant Cell (1989) 1:095-1109), 

25 soybean b-conglycinin (Beachy et al., EMBO J. (1985) 
4:3047-3053; pea vicilin (Higgins et al., Plant Mol. 
Biol. (1988) 11: 683-695) , pea convicilin (Newbigin et 
al., Planta (1990) 180:461-470), pea legumin (Shirsat et 
al., Mol. Gen. Genetics (1989) 215:326-331) ; rapeseed 

30 napin (Radke et al., Theor. Appl. Genet. (1988) 

75:685-694) as well as genes from monocotyledonous 
plants such as for maize 15 kD zein (Hoffman et al., 
EMBO J. (1987) 6:3213-3221), maize 18 kb oleosin (Lee at 
al., Proc. Natl. Acad. Sci. USA (1991) 888:6181-6185), 

35 barley b-hordein (Marris et al., Plant Mol. Biol. (1988) 
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10:359-366) and wheat glutenin (Colot ec al., EMBO J. 
(1987) 6:3559-3564). Moreover, promoters of seed- 
specific genes operably linked to heterologous coding 
sequences in chimeric gene constructs also maintain 
5 their temporal and spatial expression pattern in 
transgenic plants. Such examples include use of 
Rrahidopsis i-hal tana 2S seed storage protein gene 
promoter to express enkephalin peptides in Arab i dops i s 
and fi. napus seeds (Vandekerckhove et al., 

10 Bio/Technology (1989) 7:929-932), bean lectin and bean 
b-phaseoiin promoters to express luciferase (Riggs et 
al., Plant Sci. (1989) 63:47-57), and wheat glutenin 
promoters to express chloramphenicol acetyl transferase 
(Colot et al., EMBO J. (1987) 6:3559-3564). 

15 of particular use in the expression of the nucleic 

acid fragment of the invention will be the heterologous 
promoters from several soybean seed storage protein 
genes such as those for the Kunitz trypsin inhibitor 
(Jofuku et al., Plant Cell (1989) 1:1079-1093; glycinin 

20 (Nielson et al., Plant Cell (198ft) " 1:313-328) , and 
b-conglycinin (Harada et al., Plant Cell (1989) 
1:415-425). Promoters of genes for a- and b-subunits of 
soybean {J-conglycinin storage protein will be 
particularly useful in expressing the mRNA or the 

25 antisense RNA in the cotyledons at mid- to late-stages 
of seed development (Beachy et al., EMBO J. (1985) 
4:3047-3053) in transgenic plants. This is because 
there is very little position effect on their expression 
in transgenic seeds, and the two promoters show 

30 different temporal regulation. The promoter for the 
a-subunit gene is expressed a few days before that for 
the b-subunit gene. This is important for transforming 
rapeseed where oil biosynthesis begins about a week 
before seed storage protein synthesis (Murphy et al., J. 

35 Plant Physiol. (1989) 135:63-69). 
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Also of particular use will be promoters of genes 
expressed during early embryogenesis and oil 
biosynthesis. The native regulatory sequences, 
including the native promoters, of the glycerolipid 
5 desaturase genes expressing the nucleic acid fragments 
of the invention can be used following their isolation 
by those skilled in the art. Heterologous promoters 
from other genes involved in seed oil biosynthesis, such 
as those for fi. naoua isocitrate lyase and malate 

10 synthase <Comal et al., Plant Cell (1989) 1:293-300), 

delta-9 desaturase from saf flower (Thompson et al. Proc. 
Natl. Acad. Sci. USA (1991) 88:2578-2582) and castor 
(Shanklin et al., Proc. Natl. Acad. Sci. USA (1991) 
88:2510-2514), acyl carrier protein (ACP) from 

15 ai^MrtiiDsis (Post-Beittenmiller et al., Nucl. Acids Res. 
(1989) 17:1777), fi. napJlS. (Safford et al., Eur. J. 
Biochem. (1988) 174 : 287-295) , and fi. rnmpftS-tf i a (Rose et 
al., Nucl. Acids Res. (1987) 15:7197), b-ketoacyl-ACP 
synthetase from barley (Siggaard-Andersen et al., Proc. 

20 Natl. Acad. Sci. USA (1991) 88:4114-4118), and oleosin 
f rom ifiijnayi (Lee et al., Proc. Natl. Acad. Sci. USA 
(1991) 88:6181-6185), soybean (Genbank Accession No: 
X60773) and*, naEUS. (Lee et al., Plant Physiol. (1991) 
96:1395-1397) will be of use. If the sequence of the 

25 corresponding genes is not disclosed or their promoter 
region is not identified, one skilled in the art can use 
the published sequence to isolate the corresponding gene 
and a fragment thereof containing the promoter. The 
partial protein sequences for the relatively-abundant 

30 enoyl-ACP reductase and acetyl-GoA carboxylase are also 
published (Slabas et al.-, Biochim. Biophys. Acta (1987) 
877:271-280; Cottingham et al., Biochim. Biophys. Acta 
(1988) 954:201-207) and one skilled in the art can use 
these sequences to isolate the corresponding seed genes 

35 with their promoters. Similarly, the fragments of the 
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present invention encoding glycerolipid desaturases can 
be used to obtain promoter regions of the corresponding 
genes for use in expressing chimeric genes. 

Attaining the proper level of expression of the 
5 nucleic acid fragments of the invention may require the 
use of different chimeric genes utilizing different 
promoters. Such chimeric genes can be transferred into 
host plants either together in a single expression 
vector or sequentially using more than one vector. 

10 It is envisioned that the introduction of enhancers 

or enhancer-like elements into the promoter regions of 
either the native or chimeric nucleic acid fragments of 
the invention will result in increased expression to 
accomplish the invention. This would include viral 

15 enhancers such as that found in the 35S promoter (Odell 
et al., Plant Mol. Biol. (1988) 10:263-272), enhancers 
from the opine genes (Fromm et al., Plant Cell (1989) 
1:977-984), or enhancers from any other source that 
result in increased transcription when placed into a 

20 promoter operably linked to the nucleic acid fragment of 
the invention. 

Of particular importance is the DNA sequence 
element isolated from the gene for the a-subunit of 
b-conglycinin that can confer 40-fold seed-specific 

25 enhancement to a constitutive promoter (Chen et al., 

Dev. Genet. (1989) 10:112-122). One skilled in the art 
can readily isolate this element and insert it within 
the promoter region of any gene in order to obtain seed- 
specific enhanced expression with the promoter in 

30 transgenic plants. Insertion of such an element in any 
seed-specific gene that is expressed at different times 
than the b-conglycinin gene will result in expression in 
transgenic plants for a longer period during seed 
development. 
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The invention can also be accomplished by a variety 
of other methods to obtain the desired end. In one 
form, the invention is based on modifying plants to 
produce increased levels of glycerolipid desaturases by 
5 virtue of introducing more than one copy of the foreign 
gene containing the nucleic acid fragments of the 
invention- In some cases, the desired level of 
polyunsaturated fatty acids may require introduction of 
foreign genes for more than one kind of glycerolipid 

10 desaturase. 

Any 3 f non-coding region capable of providing a 
polyadenylation signal and other regulatory sequences 
that may be required for the proper expression of the 
nucleic acid fragments of the invention can be used to 

15 accomplish the invention. This would include 3' ends of 
the native glycerolipid desaturase (s) , viral genes such 
as from the 35S or the 19S cauliflower mosaic virus 
transcripts, from the opine synthesis genes, ribulose 
1,5-bisphosphate carboxylase, or chlorophyll a/b binding 

20 protein. There are numerous examples in the art that 

teach the usefulness of different 3' non-coding regions. 

Tranftfor tnatinn Methods 

Various methods of transforming cells of higher 
plants according to the present invention are available 

25 to those skilled in the art (see EPO Pub. 0 295 959 A2 
and 0 318 341 Al) . Such methods include those based on 
transformation vectors utilizing the Ti and Ri plasmids 
of & ff noh a Hi-grium It is particularly preferred to 

use the binary type of these vectors. Ti-derived 

30 vectors transform a wide variety of higher plants, 

including monocotyledonous and dicotyledonous plants 
(Sukhapinda et al., Plant MOl. Biol. (1987) 8:209-216; 
Potrykus, Mol. Gen. Genet. (1985) 199:183). Other 
transformation methods are available to those skilled in 

35 the art, such as direct uptake of foreign DNA constructs 
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(see EPO Pub. 0 295 959 A2) , techniques of 
electroporation (Fromm et al. f Nature (1986) (London) 
319:791) or high-velocity ballistic bombardment with 
metal particles coated with the nucleic acid constructs 
5 (Kline et al., Nature (1987) (London) 327:70) . Once 
transformed, the cells can be regenerated by those 
skilled in the art. 

Of particular relevance are the recently described 
methods to transform foreign genes into commercially 

10 important crops, such as rapeseed (De Block et al., 
Plant Physiol. (1989) 91:694-701), sunflower (Everett 
et al., Bio/Technology (1987) 5:1201), and soybean 
(Christou et al., Proc. Natl. Acad. Sci USA (1989) 
86:7500-7504. 

15 Application to RFL? Technology 

The use of restriction fragment length polymorphism 
(RFLP) markers in plant breeding has been well- 
documented in the art (tanks ley et al., Bio/Technology 
(1989) 7:257-264). The nucleic acid fragments of the 

20 invention can be used as RFLP markers for traits linked 
to expression of glycerolipid desaturases. These traits 
will include altered levels of unsaturated fatty acids. 
The nucleic acid fragment of the invention can also be 
used to isolate the glycerolipid desaturase gene from 

25 variant (including mutant) plants with altered levels of 
unsaturated fatty acids. Sequencing of these genes will 
reveal nucleotide differences from the normal gene that 
cause the variation. Short oligonucleotides designed 
around these differences may be used as hybridization 

30 probes to follow the variation in polyunsaturates. 

Oligonucleotides based on differences that are linked to 
the variation may be used as molecular markers in 
breeding these variant oil traits. 
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EXAMPLES 

The present invention is further defined in the 
following Examples, in which all parts and percentages 
are by weight and degrees are Celsius, unless otherwise 
5 stated. It should be understood that these Examples, 
while indicating preferred embodiments of the invention, 
are given by way of illustration only. From the above 
discussion and these Examples, one skilled in the art 
can ascertain the essential characteristics of this 

10 invention, and without departing from the spirit and 
scope thereof, can make various changes and 
modifications of the invention to adapt it to various 
usages and conditions. All publications, including 
patents and non-patent literature, referred to in this 

15 specification are expressly incorporated by reference 
herein. 

EXAMPLE 1 

ISOLATION OF GENOMIC DNA FLANKING THE T-DNA SITE OF 
TNSEKTTON TN ARABTDOPSIS THAT . TANA MUTANT LINE 37Q7 
20 Identification of an Arahidopsis thaliaria T-DNA Mutant 
uith Low T.lnolgnifi Acid Content 
A population of Arahidopsis tvhfi liana (geographic 
race Wassilewski ja) transformants containing the T-DNA 
of Aorrobagterium tllfflftfagiens was generated by seed 
25 transformation as described by Feldmann et al., (Mol. 
Gen. Genetics (1987) 208:1-9). In this population the 
transformants contain DNA sequences encoding the pBR322 
bacterial vector, nopaline synthase, neomycin 
phosphotransferase (NPTII, confers kanamycin 
30 resistance) , and b-lactamase (confers ampicillin 

resistance) within the T-DNA border sequences* The 
integration of the T-DNA into different areas of the 
chromosomes of individual transformants may cause a 
disruption of plant gene function at or near the site of 
35 insertion, and phenotypes associated with this loss of 
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gene function can be analyzed by screening the 
population for the phenotype. 

T3 seed was generated from the wild type seed 
treated with A^ohacterium t.UIMVfaciens by two rounds of 
5 self-fertilization as described by Feldmann et al., 
(Science (1989) 243:1351-1354). These progeny were 
segregating for the T-DNA insertion, and thus for any 
mutation resulting from the insertion. Approximately 
100 seeds of each of 6000 lines were combined and the 

10 fatty acid content of each of the 6000 pooled samples 
was determined by gas chromatography of the fatty acyl 
methyl esters essentially as described by Browse et al., 
(Anal. Biochem. (1986) 152:141-145) except that 2.5% 
H2SO4 in methanol was used as the methylation reagent 

15 and samples were heated for 1.5 h at 80°C to effect the 
methanolysis of the seed triglycerides. A line 
designated "3707 " produced seeds that gave an altered 
fatty acid profile compared to that of the total 
population. T3 plants were grown from individual T3 

20 seeds produced by line 3707 and self-fertilized to 

produce T4 seeds on individual plants that were either 
homozygous wild type, homozygous mutant, or heterozygous 
for the mutation. The percent fatty acid compositions 
of a representative subsample of the entire population, 

25 of the pooled 3707 T3 seeds, and of a homozygous T4 
mutant segregant are shown in Table 4. 
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T3 Pools from 
lines 3S01-4000 

Fatty Acid average and 3707 T3 3707 Homozygous 

^hyi ftrrt deviation) Eaol T4 Sftgrag a nt 

palmitic 7.4 (0.37) 7.0 C.4 

stearic 3.0 (0.22) 2.9 3.0 

oleic 17.0 (1.5) 17.7 15.9 

linoleic 29.3 (0.78) 35.0 42.4 

linolenic <*-l> 10 ^ 2 3 * 1 

eicoaenoic 20.2 (0.73) 20.5 23.6 

The phenotype of the segregating T3 pool of line 3707 
(high linoleic acid, low linolenic acid) was 
intermediate between that of the population subsample 
5 and the homozygous T4 mutant seeds suggesting that line 
3707 harbored a mutation at a locus which controls the 
conversion of linoleic to linolenic acid in the seed. 
Still, it was not apparent whether the mutant phenotype 
in line 3707 was the result of a T-DNA insertion. 

10 Therefore, Applicants checked a segregating T4 

population to determine whether the mutant fatty acid 
phenotype cosegregated with the nopaline synthase 
activity and kanamycin resistance encoded by the T-DNA 
insert. A total of 263 T4 plants were grown and assayed 

15 for the presence of nopaline in leaf extracts 

(Errampalli et al., The Plant Cell (1991) 3:149-157) . 
in addition, T5 seeds were collected from each of the T4 
plants and samples of 10-50 seeds were taken to 
determine the seed fatty acid composition and to 

20 determine their ability to germinate in the presence of 
kanamycin (Feldmann, et al., (1969) Science 
243:1351-1354). The 263 plants fell into 3 classes as 
in Table 5. 
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TARLfl 5 

Number of 

Individuals Phfinntvp* 

63 T4 plants: little or no n opaline present; T5 
aeeds: wild type fatty acid composition, all 
kanamycin sensitive 

134 T4 plants: nopaline present; T5 seeds: 

heterozygous fatty acid composition similar 
to 3707 T3 pool, segregating for kanamycin 
resistance 

64 T4 plants: nopaline present; T5 seeds 
homozygous mutant fatty acid composition, all 

r kanamycin resistant 

The cosegregatiori of the fatty acid phenotype with the 
phenotypes conferred by T-DNA sequences in an 
approximately 1:2:1 pattern provided strong evidence 
5 that the mutation in line 3707 was the result of a T-DNA 
insertion. Further experiments were then conducted with 
the intent of using probes containing T-DNA sequences to 
clone the T-DNA insert and flanking genomic DNA from 
line 3707. 

10 Preparation of Genomic DNA from Homo^Yymi.g 3707 Plant- <t 
Seeds from a homozygous line derived from 
Arabiriop<H r thalia-na (geographic race Wassilewski ja 
(WS) ) line 3707 were surface sterilized for 5 min at 
room temperature in a solution of 5.25% sodium 

15 hypochlorite (w/v)/0.15% Tween 20 (v/v) , then washed 
several times in sterile distilled water, with a final 
rinse in 50% ethanol. Immediately following the ethanol 
wash, the seeds were transferred to sterile filter paper 
to dry. One to three seeds were then transferred to 

20 250-mL flasks containing 50 mL of sterile Gamborgs B5 
media {Gibco, 500-1153EA)-, pH 6.0. Cultures were 
incubated at 22°C, 70 HE • /nr 2 • sec" 1 of continuous light 
for approximately three weeks, after which time the root 
tissue was harvested, made into 10 g aliquots (wet 

25 weight), lyophilized, and stored at -20°C. 
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Using a variation of the procedure of Shure et al., 
(Cell (1983) 35:225-233) genomic DNA was isolated from 
the root tissue. Two aliquots.of lyophilized tissue 
were ground to a fine powder using a mortar and pestle . 
5 The ground tissue was added to a flask containing 85 mL 
of lysis buffer (7 M urea, 0.35 M NaCl, O.OS M Tris-HCl, 
pH 8.0, 0.02 M EDTA, 1% Sarkosyl, 5% phenol) and mixed 
gently with a glass rod to obtain a homogeneous 
suspension. To this suspension an equal volume of 

10 phenol: chloroform: isoamyl alcohol (25:24:1) 

(equilibrated with 10 mM Tris, pH 8, 1 mM EDTA) was 
added. After the addition of 8.5 mL of 10% SDS the 
mixture was swirled on a rotating platform for 15 min at 
room temperature. After centrifugation at 2000xg for 15 

15 min, the upper aqueous phase was removed to a new tube 
and extracted two more times, as above, but without the 
addition of SDS. To the final aqueous phase was added 
l/20th the volume of 3 M potassium acetate, pH 5.5 and 
two times the volume of ice cold 100% ethanol. 

20 Precipitation of the DNA was facilitated by incubation 
at -20°C for one hour followed by centrifugation at 
12,000xg for 10 min. The resulting pellet was 
resuspended in 3 mL of 10 mM Tris, P H 8, 1 mM EDTA to 
which was added 0.95 g of cesium chloride (CsCl) and 

25 21.4 yLL of 10 mg/mL ethidium bromide (EtBr) per mL of 
solution:" The DNA was then purified by centrifugation 
to equilibrium in a CsCl/EtBr density gradient for 16 h 
at 15°C, 265,000xg. After removal from the gradient, 
the DNA was extracted with isopropanol saturated with IE 

30 buffer (10 mM Tris, pH 8; 1 mM EDTA) and CsCl to remove 
EtBr and then dialyzed overnight at 4°C against 10 mM 
Tris, pH 8, 1 mM EDTA to remove CsCl. The DNA was 
removed from dialysis and the concentration was 
determined using the Hoechst fluorometric assay in which 

35 an aliquot of DNA is added to 3 mL of 1.5 X 10~ e M bis- 
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benzimide (Hoechst 33258, Siga) in IX SSC (0,15 M NaCI, 
0.015 M sodium citrate), pH 7.0, incubated at room 
temperature for 5 rain, and read on a fluorometer at 
excitation 360, emission 450, against a known set of DNA 
5 standards . 

Plasmid Rescue and Analysis 
Five micrograms of genomic DNA from the homozygous 
3707 mutant , prepared as described above, was digested 
with 20 units of either Bam HI or Sal I restriction 

10 enzyme (Bethesda Research Laboratory) in a 50 J1L 
reaction volume according to the manufacturer's 
specifications. After digestion the DNA was extracted 
with buffer-saturated phenol (Bethesda Research 
Laboratory) followed by precipitation in ethanol. The 

15 resulting pellet was resuspended in a final volume of 10 
flL of 10 mM Tris, pH 8, and the concentration of the DNA 
was determined using the Hoechst fluorometric assay as 
above . 

To facilitate circularization, as opposed to end- 

20 to-end joining, a dilute ligation reaction was set up 
containing 250 ng of Bam HI or Sal I digested genomic 
DNA, 3 Weiss units of T4 DNA ligase (Promega) , 50 JIL of 
10X ligase buffer (30 mM Tris-HCl, pH 7.8, 100 mM MgCl2, 
100 mM DTT, 5 mM ATP) and 5 ^IL of 100 mM ATP in a 500 J1L 

25 reaction volume. The reaction was incubated for 16 h at 
16°C, heated for 10 min at 70°C, and extracted once with 
buffer saturated phenol (Bethesda Research Laboratory) . 
The DNA was then precipitated with the addition of two 
volumes of 100% ethanol and 1/lOth volume of 7.5 M 

30 ammonium acetate. The resulting pellet was resuspended 
in a final volume of 10 ^IL of 10 mM Tris, pH 8, and the 
concentration of the DNA was determined using the 
Hoechst fluorometric assay as above. 

Competent DH10B cells (Bethesda Research 

35 Laboratory) were transfected with 50 ng of ligated DNA 
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at a concentration of 10 ng of DNA per 100 flL of cells 
according to the manufacturer's specifications. 
Transformants from Sal I or Bam HI digests were selected 
on LB plates (10 g Bacto-tryptone, 5 g Bacto-yeast 
5 extract, 5 g NaCl, 15 g agar per liter, pH 7.4) 

containing 100 [ig/mL ampicillin or 25 jig/mL kanamycin 
sulfate, respectively, Ampicillin-resistant (Amp r ; 
ampicillin sensitivity, Amp 3 ) Sal I tranformants were 
screened for the presence of the kanamycin resistance 
10 (Kan r ; kanamycin sensitivity, Kan 3 ) gene by picking 
primary tranformants and stabbing them first to LB 
plates containing 100 jlg/mL ampicillin then to LB plates 
containing 25 jLg/mL kanamycin- After overnight 
incubation at 37°C the plates were scored for Amp r /Kan s 

15 colonies. Kanamycin-resistant « Bam HI transformants were 
screened for the presence of the ampicillin resistance 
gene by picking primary transformants and stabbing them 
first to LB plates containing 25 Jlg/mL kanamycin and 
then to LB plates containing 100 p.g/mL ampicillin. 

20 After overnight incubation at 37°C the plates were 
scored for Kan r /Amp r colonies. 

Cultures were made of 192 Amp r /Kan 3 Sal I 
transformants and 85 Kan r /Amp r Bam HI transformants 
directly into deep-well microtiter plates containing 

25 200 JiL of LB broth (10 g Bacto-tryptone, 5 g Bacto-yeast 
extract, 5 g NaCl per liter) with 100 Jlg/mL ampicillin. 
Using the Schleicher and Schuell Minifold I apparatus 
and Nytran membranes, dot blots were set up, in 
duplicate, using the following conditions: 50 \XL of 

30 culture was diluted into 150 J1L of 5X SSC, the culture 
was lysed and the DNA denatured by the addition of 
150 JtL of 0.5 M NaOH, 1.5 M NaCl solution for 3 min at 
room temperature, the filter was removed from the 
apparatus and neutralized in 0.5 M Tris, pH 8, 1.5 M 

35 NaCl, the DNA was then UV cross-linked to the filters 
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using the Stratagene Stratalinker, and the filters were 
heated for 2 h at 80°C and stored at room temperature. 

To determine whether T-DNA was contained within any 
of the rescued plasmids, the dot blots were probed with 
5 portions of the right and left borders of T-DNA. The 
right border probe consisted of a 2.2 kb Hind III-Dra I 
fragment of DNA obtained from plasmid H23pKC7 (composed 
of the 3.2 kb Hind III 23 fragment from Ti plasmid 
pTiC58 (Lemmers et al., J. Mol. Biol. (1989) 

10 144;353-376) cloned into plasmid vector pKC7 (Maniatis 
et al., Molecular Cloning/ A Laboratory Manual (1982) 
Cold Spring Harbor Laboratory Press) ) , and the left 
border probe consisted of a 2.9 kb Hind III-Eco RI 
fragment obtained from plasmid H10pKC7 (composed of the 

15 6.5 kb Hind III 10 fragment from Ti plasmid pTiC58 
(Lemmers et al., J. Mol. Biol. (1989) 144:353-376) 
cloned into plasmid vector pKC7 (Maniatis et al., 
Molecular Cloning, A Laboratory Manual (1982) Cold 
Spring Harbor Laboratory Press)) using standard 

20 digestion, electrophoresis, and electroelution 

conditions as described in Sambrook et al., (Molecular 
Cloning, A Laboratory Manual, 2nd ed (1989) Cold Spring 
Harbor Laboratory Press) . Final DNA purification was 
obtained by passage of the eluted DNA over an Elutip-D 

25 column (Schleicher and Schuell) using the manufacturer's 
specifications. Concentration of the DNA was determined 
using the Hoechst fluorometric assay as above. 
Approximately 100 ng of each probe was labeled with 
a[ 32 P)dCTP using a Random Priming Kit from Bethesda 

30 Research Laboratories under conditions recommended by 
the manufacturer. Labeled probe was separated from 
unincorporated a[ 32 P]dCTP by passing the reaction 
through a Sephadex G-25 spun column under standard 
conditions as described in Sambrook et al., (Molecular 
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(ix) FEATURE: 



AAG GAT TGC TGG GTT AAG AAT CGA TGG AAG TCT ATG ACT TAC GTC GTC 
Lys His Cys Trp Vai Lys Asn Pro Trp Lys Ser Met Ser Tyr Val Val 
rr- in 75 80 



65 



96 



(A) NAME/KEY: CDS 

(B) LOCATION: 1..1215 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: Q 

TTC AAA TTC AGA CAA TCC CCT TCT TCT CCC CGG TTT CGT CTG AAC TCT 48 
Phe Lys Phe Arg Gin Ser Pro Ser Ser Pro Arg Phe Arg Leu Asn Ser 

15 10 15 ' f 

CGA AAC TGG GCG TTG AAT GTA ACC ACA CCT CTA ACA GTC GAC TCC TCA 
Arg Asn Trp Ala Leu Asn Val Thr Thr Pro Leu Tar Val Asp Ser Ser 
20 25 30 

TCA TCT CCT CCA ATC GAG GAA GAA CCC AAA ACG CAG AGA TTC GAC CCA 144 
Ser Ser Pro Pro He Glu Glu Glu Pro Lys Thr Gin Arg Phe Asp Pro 
35 40 45 

GGG GCT CCT CCT CCG TTC AAC CTA GCT GAC ATC AGA GCG GCG ATA CCT 192 
Gly Ala Pro Pro Pro Phe Asn Leu Ala Asp He Arg Ala Ala He Pro 
50 55 60 



240 



288 



AGA GAG CTA GCC ATC GTG TTC GCA CTA GCT GCT GGA GCT GCT TAC CTC 
Arg Glu Leu Ala He Val Phe Ala Leu Ala Ala Gly Ala Ala Tyr Leu 
* 85 90 95 

AAC AAT TGG CTT GTT TGG CCT CTC TAT TGG ATT GCT CAA GGA ACC ATG 336 
Asn Asn Trp Leu Val Trp Pro Leu Tyr Trp He Ala Gin Gly Thr Met 
100 105 HO 



TTC TGG GCT CTC TTT GTT CTT GGC CAT GAC TGT GGA CAT GGA AGC TTC 384 
Phe Trp Ala Leu Phe Val Leu Gly His Asp Cys Gly His Gly Ser Phe 
115 120 125 

TCA AAT GAT CCG AGG TTG AAC AGT GTG GTG GGT CAC CTT CTT CAT TCC 432 
Ser Asn Asp Pro Arg Leu Asn Ser Val Val Gly His Leu Leu His Ser 
130 135 I* 0 

TCT ATT CTA GTC CCT TAC CAT GGC TGG AGA ATT AGC CAC AGA ACT CAC 
Ser He Leu Val Pro Tyr His Gly Trp Arg He Ser His Arg Thr His 
145 150 155 1«° 

CAC CAG AAC CAT GGA CAT GTT GAG AAC GAT GAA TCT TGG CAT CCT ATG 
His Gin Asn His Gly His Val Glu Asn Asp Glu Ser Trp His Pro Met - 
165 . "0 175 

TCT GAG AAA ATC TAC AAG AGT TTG GAC AAA CCC ACT CGG TTC TTT AGA 576 
Ser Glu Lys lie Tyr Lys Ser Leu Asp Lys Pro Thr Arg Phe Phe Arg 
180 185 190 

TXT ACA TTG CCT CTC GTG ATG CTC GCT TAC CCT TTC TAC TTG TGG GCA 624 
Phe Thr Leu Pro Leu Val Met Leu Ala Tyr Pro Phe Tyr Leu Trp Ala 
195 200 205 



480 



528 
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AGA AGT CCA GGG AAG AAG GGT TCT CAT TAC CAT CCA GAC AGC GAC TTG 672 
Arg Ser Pro Gly Lys Lys Gly Ser His Tyr His Pro Asp Ser Asp Lou 
210 215 220 

TTC CTT OCT AAA GAG AGA AAC GAT GTT CTC ACT TCT ACC GCT TGT TGG 720 
Phe Leu Pro Lys Glu Arg Asn Asp Val Leu Thr Ser Thr Ala Cys Trp 
225 230 235 240 

ACT GCA ATG GCT GTT CTG CTT GTC TGT CTC AAC TTC GTG ATG GrGT CCA 768 
Thr Ala Met Ala Val Leu Leu Val Cys Leu Asn Phe Val Met Gly Pro 
245 250 255 

ATG CAA ATG CTC AAA CTT TAT GTC ATT CCT TAC TGG ATA AAT GTA ATG 816 
Met Gin Met Leu Lys Leu Tyr Val He Pro Tyr Trp He Asa Val Met 
260 265 270 

TGG TTG GAC TTT GTG ACT TAC CTG CAT CAC CAT GGT CAT GAA GAT AAG 864 
Trp Leu Asp Phe Val Thr Tyr Leu His His His Gly His Glu Asp Lys 
275 280 285 

CTC CCT TGG TAC CGT GGG AAG GAA TGG AGT TAC TTG AGA GGA GGA CTT 912 
Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu 
290 295 300 

ACA AGA TTG GAC CGG GAC TAC GGA TTG ATC AAC AAC ATC CAT CAC GAC 960 
Thr Thr Leu Asp Arg Asp Tyr Gly Leu lie Asn Asn He His His Asp 
305 310 315 320 

ATT GGA ACT CAT GTG ATA CAT CAT CTT TTC CCT CAG ATC CCA CAT TAT 1008 
lie Gly Thr His Val He His His Leu Phe Pro Gin He Pro His Tyr 
325 330 335 

CAT CTA GTA GAA GCA ACA GAA GCA GCT AAA CCA GTA TTA GGG AAG TAT .1056 
His Leu Val Glu Ala Thr Glu Ala Ala Lys Pro Val Leu Gly Lys Tyr 
340 345 350 

TAT AGG GAG CCT GAT AAG TCT GGA CCT TTG CCA TTA CAT TTA CTG GGA 1104 
Tyr Arg Glu Pro Asp Lys Ser Gly Pro Leu Pro Leu His Leu Leu Gly 
355 360 365 

ATC TTA GCA AAA AGT ATT AAA GAA GAT CAT TTT GTG AGC GAT GAA GGA 1152 
lie Leu Ala Lys Ser He Lys Glu Asp His Phe Val Ser Asp Glu Gly 
370 375 380 

GAT GTT CTA TAC TAT GAA GCA GAC CCT AAT CTC TAT GGA GAG ATC AAG 1200 
Asp Val Val Tyr Tyr Glu Ala Asp Pro Asn Leu Tyr Gly Glu He Lys 
385 390 395 400 

GTA ACA GCA GAG TGAAATGAAG CTGTCAGATT TATCTATTTC TGACCAGCTG 1252 
Val Thr Ala Glu 

405 

ATTTTTTTTG CTTATTAATG TCAATTCATT GTGTTACCAT TATCTCTGAA TACAATCAGA 1312 

TGGAAACCCC AACTTTGTTT TCAATACTTG AAGCTATATA TATATATATA TATGTAAGAT 1372 

ACATTGTATT GTCATTAGAT TCACCATTCT CAAGGTTCTT ATACAAAAAA AAAAAAA 1429 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Phe Lys Phe Arg Gin Ser Pro Ser Ser Pro Arg Phe Arg Leu Asn Ser 
1 5 10 15 

Arg Asn Trp Ala Leu Asn Val Thr thr Pro Leu Thr Val Asp Ser Ser 
20 25 30 

Ser Ser Pro Pro lie Glu Glu Glu Pro Lys Thr Gin Arg Phe Asp Pro 
35 40 45 

Gly Ala Pro Pro Pro Phe Asn Leu Ala Asp He Arg Ala Ala He Pro 
50 55 60 

Lys His Cys Trp Val Lys Asn Pro Trp Lys Ser Met Ser Tyr Val Val 
65 70 75 80 

Arg Glu Leu Ala He Val Phe Ala Leu Ala Ala Gly Ala Ala Tyr Leu 
85 90 95 

Asn Asn Trp Leu Val Trp Pro Leu Tyr Trp He Ala Gin Gly Thr Met 
100 105 HO 

Phe Trp Ala Leu Phe Val Leu Gly His Asp Cys Gly His Gly Ser Phe 
115 120 125 

Ser Asn Asp Pro Arg Leu Asn Ser Val Val Gly His Leu Leu His Ser 
130 135 140 

Ser He Leu Val Pro Tyr His Gly Trp Arg lie Ser His Arg Thr His 
145 150 155 160 

His Gin Asn His Gly His Val Glu Asn Asp Glu Ser Trp His Pro Met 
165 170 175 

Ser Glu Lys lie Tyr Lys Ser Leu Asp Lys Pro Thr Arg Phe Phe Arg 
180 165 190 

Phe Thr Leu Pro Leu Val Met Leu Ala Tyr Pro Phe Tyr Leu Trp Ala 
195 200 205 

Arg Ser Pro Gly Lys. Lys Gly Ser His Tyr His Pro Asp Ser Asp Leu 
210 215 220 

Phe Leu Pro Lys Glu Arg Asn Asp Val Leu Thr Ser Thr Ala Cys Trp 
225 230 235 240 



WO 93/1 1245 PCT/LS92/10284 

143 



Thr Ala Met Ala Val Leu Leu Val Cys Leu Asn Phe Val Met Gly Pro 
245 250 255 

Met Gin Met Leu Lys Leu Tyr Val He Pro Tyr Trp He Asn Val Met 
260 265 270 

Trp Leu Asp Phe Val Thr Tyr Leu His His His Gly His Glu Asp Lys 
275 280 285 

Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu 
290 295 300 

Thr Thr Leu Asp Arg Asp Tyr Gly Leu lie Asn Asn lie His His Asp 
305 310 315 320 

lie Gly Thr His Val lie His His Leu Phe Pro Gin He Pro His Tyr 
325 330 335 

His Leu Val Glu Ala Thr Glu Ala Ala Lys Pro Val Leu Gly Lys Tyr 
340 345 350 

Tyr Arg Glu Pro Asp Lys Ser Gly Pro Leu Pro Leu His Leu Leu Gly 
355 360 365 

He Leu Ala Lys Ser He Lys Glu Asp His Phe Val Ser Asp Glu Gly 
370 375 380 

Asp Val Val Tyr Tyr Glu Ala Asp Pro Asn Leu Tyr Gly Glu He Lys 
385 390 395 400 

Val Thr Ala Glu 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Glycine max 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: pXFl 
<ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 855.. 1997 




(aci) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 




ACAATAATAA ATCCATATTT TTATAATTAA AAGTAGTAGA TTACAGC6AT GCACTTGAGA 


60 


AACATATTAA GTGGACTAAT TCTCCCTGGT CAAGCAAGAA AAAAACCA6C TATGACCCAA 


120 


GGTAGAGAGA GATTATACAC AGAATACTAG TAATTAACTA AGACTGGCTC TGCAATTGCC 


180 


AAAAACTCCA TTGCAGTAGC AGCCACCTGA GAAGACACTA A6ACCTAGAC TAGACCATAC 


240 


ATATGAAGAT TAATAGGCTT ACATAACAAC ATAGGACACT AAGAAAACAC GGCTTACAGA 


300 


GAATCCAGCT GACTCTATAA GAGGGGTACT TCTGGAGATT AAAATTATCC GAATCACCTT 


3*0 


CCCACTGCGG CTGCTGACGT CAGCGAAAGT CAGAAGCGAA AGCGGCGAAG AACCTTCAGA 


420 


AGAGGAGGAA GCAGTTCGAC CTTACAAGAG TTGTTGTCGT TGTTGTTGTC GTTCTCTGGC 


480 


GGAGAAGCGA GTTTGGATCG CGTTTTCCTC GGAGGCTTCT CGGTCTTCCC CTGTTTCTGC 


540 


AGCTCAGCCA GGCCCTCGCA AATGGCCTGA AGCTTGGCGT CAACGGCGGA ATGAAGAGGC 


600 


TAATACTCCC CGAAGTCACC AGCGACGGAG GAACCCTGGT GTCGGAGGTT GGGGAAGTTG 


660 


AGCCTGGCGA AGTCACCTCG GAGCTTGTAC GCGGCCTTGT GGTACGCCAG AGCGGCTTCC 


720 


TCGGCGGTGT CGAAGGTTCC CAGCCATAGC GTGGTCCGGA TTCTTCGGGA GTCTAAXCTC 


780 


AGCCACCCAC TTCCCCCCTG AGAAAAGAGA GGAACCACAC TCTCTAAGCC AAAGCAAAAG 


840 


CAGCAGCAGC AGCA ATG GTT AAA GAC ACA AAG CCT TTA GCC TAT GCT GCC 


890 


Met Val I*ys Asp Thr Lys Pro Leu Ala Tyr Ala Ala 




1 5 10 





AAT AAT GGA TAC CAA CAA AAG GGT TCT TCT TTT GAT TTT GAT CCT AGC 938 
Aisn Asn Gly Tyr Gin Gin Lys Gly Ser Ser Phe Asp Phe Asp Pro Ser 
15 20 25 

GCT CCT CCA CCG TTT AAG ATT GGA GAA ATC AGA GCT TCA ATA CCA AAA 986 
Ala Pro Pro Pro Phe Lys He Ala Glu He Arg Ala Ser He Pro Lys 
30 35 40 

CAT TGC TGG GTC AAG AAT CCA TGG AGA TCC CTC AGT TAT GTT CTC AGG 1034 
His Cys Trp Val Lys Asn Pro Trp Arg Ser Leu Ser Tyr Val Leu Arg 
45 50 55 60 

GAT GTG CTT GTA ATT GCT GCA TTG GTG GCT GGA GCA hVT CAC TTC GAC 1082 
Asp Val Leu Val He Ala Ala Leu Val Ala Ala Ala He His Phe Asp 
65 ' 70 75 

AAC TGG CTT CTC TGG CTA ATC TAT TGC CCC ATT CAA GGC ACA ATG TTC 1130 
Asn Trp Leu Leu Trp Leu He Tyr Cys Pro He Gin Gly Thr Met Phe 
80 85 90 
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TGG GCT CTC TTT GTT CTT GGA CAT GAT TGT GGC CAT GGA AGC TTT TCA 1178 
Trp Ala Leu Phe Val Leu Gly His Asp Cys Gly His Gly Ser Phe Ser 
95 100 105 

GAT AGC CCT TTG CTG AAT AGC CTG GTG GGA CAC ATC TTG CAT TCC TCA 1226 
Asp Ser Pro Leu Leu Asn Ser Leu Val Gly His lie Leu His Ser Ser 
110 H5 120 

ATT CTT GTG CCA TAG CAT GGA TGG AGA ATT AGC CAC AGA ACT CAC CAT 1274 
lie Leu Val Pro Tyr His Gly Trp Arg He Ser His Arg Thr His His 
125 130 135 140 

CAA AAC CAT GGA CAC ATT GAG AAG GAT GAG- TCA TGG GTT CCA TTA ACA 1322 
Gin Asn His Gly His He Glu Lys Asp Glu Ser Trp Val Pro Leu Thr 
145 150 155 

GAG AAG ATT TAC AAG AAT CTA GAC AGC ATG ACA AGA CTC ATT AGA TTC 1370 
Glu Lys He Tyr Lys Asn Leu Asp Ser Met Thr Arg Leu lie Arg Phe 
160 165 170 

ACT GTG CCA TTT CCA TTG TTT GTG TAT CCA ATT TAT TTG TTT TCA AGA 1418 
Thr Val Pro Phe Pro Leu Phe Val Tyr Pro lie Tyr Leu Phe Ser Arg 
175 180 185 

AGC CCC GGA AAG GAA GGC TCT CAC TTC AAT CCC TAC AGC AAT CTG TTC 1466 
Ser Pro Gly Lys Glu Gly Ser His Phe Asn Pro Tyr Ser Asn Leu Phe 
190 195 200 

CCA CCC AGT GAG AGA AAA GGA ATA GCA ATA TCA ACA CTG TGT TGG GCT 1514 
Pro Pro Ser Glu Arg Lys Gly He Ala He Ser Thr Leu Cys Trp Ala 
205 210 215 220 

ACC ATG TTT TCT CTG CTT ATC TAT CTC TCA TTC ATA ACT AGT CCA CTT 1562 
Thr Met Phe Ser Leu Leu He Tyr Leu Ser Phe lie Thr Ser Pro Leu 
225 230 235 

CTA GTG CTC AAG CTC TAT GGA ATT CCA TAT TGG ATA TTT GTT ATG TGG 1610 
Leu Val Leu Lys Leu Tyr Gly . He Pro Tyr Trp He Phe Val Met Trp 
240 245 250 

CTG GAC TTT GTC ACA TAC TTG CAT CAC CAT GGT CAC CAC CAG AAA CTG 1658 
Leu Asp Phe Val Thr Tyr Leu His His His Gly His His Gin Lys Leu 
255 260 265 

CCT TGG TAC CGC GGC AAG GAA TGG AGT TAT TTA AGA GGT GGC CTC ACC 1706 
Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr 
270 275 280 

ACT GTG. GAT CGT GAC TAT GGT TGG ATC TAT AAC ATT CAC CAT GAC ATT 1754 
Thr Val Asp Arg Asp Tyr Gly Trp He Tyr Asn He His His Asp He 
285 290 ' 295 300 

GGC ACC CAT GTT ATC CAC CAT CTT TTC CCC CAA ATT CCT CAT TAT CAC 1802 
Gly Thr His Val He His His Leu Phe Pro Gin He Pro His Tyr His 
305 310 315 
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CTC GTT GAA GCG ACA CAA GCA GCA AAA CCA GTT CTT GGA GAT TAG TAC 1850 
Leu Val Glu Ala Thr Gin Ala Ala Lys Pro Val Leu Gly Asp Tyr Tyr 
320 325 330 

CGT GAG CCA GAA AGA TCT GCG CCA TTA CCA TTT CAT CTA ATA AAG TAT 1098 
Arg Glu Pro Glu Arg Ser Ala Pro Leu Pro Phe His Leu lie Lys Tyr 
335 340 345 

TTA ATT CAG AGT ATG AGA CAA GAC CAC TTC GTA AGT GAC ACT GGA GAT 1946 
Leu He Gin Ser Met Arg Gin Asp His Phe Val Ser Asp Thr Gly Asp 
350 355 360 

GTT GTT TAT TAT GAG ACT GAT TCT CTG CTC CTC CAC TCG CAA CGA GAC 1994 
Val Val Tyr Tyr Gin Thr Asp Ser Leu Leu Leu His Ser Gin Arg Asp 
365 370 375 380 

TGAGTTTCAA ACTTTTTGGG TTATTATTTA TTGGATTCTA GCTACTCAAA TTACTTTTTT 2054 

TTTAATGTTA TGTTTTTTGG AGTTTAACGT TTTCTGAACA ACTTGCAAAT TACTTGCATA 2114 

GAGAGACATG GAATATTTAT TTGAAATTAG TAAGGTAGTA ATAATAAATT TTGAATTGTC 2174 

AGTTTCA 

(2) INFORMATION FOR SEQ ID NO; 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii ) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Val Lys Asp Thr Lys Pro Leu Ala Tyr Ala Ala Asn Asn Gly Tyr 
15 10 15 

Gin Gin Lys Gly Ser Ser Phe Asp Phe Asp Pro Ser Ala Pro Pro Pro 
20 25 30 

Phe Lys lie Ala Glu lie Arg Ala Ser Xle Pro Lys His Cys Trp Val 
35 40 45 

Lys Asn Pro Trp Arg Ser Leu Ser Tyr Val Leu Arg Asp Val Leu Val 
50 55 60 

lie Ala Ala Leu Val Ala Ala Ala lie His Phe Asp Asn Trp Leu Leu 
65 70 " 75 80 

Trp Leu lie Tyr Cys Pro He Gin Gly Thr Met Phe Trp Ala Leu Phe 
85 90 95 

Val Leu Gly His Asp Cys Gly His Gly Ser Phe Ser Asp Ser Pro Leu 
100 105 HO 
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Leu Asn Ser Leu Va'l Glv Hie tu r ~ ' ' ■ 

US HaS Leu HiS Ser Ser lie Leu Val Pro 



120 



125 



Tyr His Gly Trp Arg lie Ser His Arg Thr His ? i s Gln Asn ^ ^ 



140 



Hi, lie G l« Lys Asp g ser Tsp val pro ^ ^ ne ^ 



155 



160 

Lys Asn Leu Asp Ser Met Thr Arg Leu lie Arg Phe Thr Val Pro Phe 



175 



Pro Leu Phe Val Tyr Pro lie Tyr Leu Phe Ser Arg Ser Pro Gly Lys 

185 190 
Giu Gly ser His Phe Asn Pro Tyr Ser Asn Leu Phe Pro Pro Ser Glu 

<*U0 205 

Arg Lys Gly lie Ala lie Ser Thr Leu Cya trp Ala Thr Met Phe Ser 

220 

Leu leu He Tyr Leu Ser Phe He Thr Ser Pro Leu Lea Val Leu Lys 

235 240 
L« Tyr Gly lie Pro Tyr Trp He Phe Val Met Trp Leu Asp Phe Val 

250 255 
Thr Tyr Leu His His His Gl y His His Gl„ Ly3 ^ Pro Trp ^ 

255 270 
Gly Lys Glu Trp Ser Tyr Leu Arg Gl y Gly Leu Thr Thr Val Asp Arg 

285 

Asp Tyr Gly Trp He Tyr Asn lie His His Asp Ue Gly Thr His Val 

295 300 
lie His His Leu Phe Pro Gin He Pro Hi s Tyr His Leu Val Glu Ala 



315 



320 



Thr Gin Ala Ala Lys Pro Val Leu Gly Asp Tyr Tyr Arg Glu Pro Glu 



335 



Arg Ser Ala Pro Leu Pro Phe His Leu He Lys Tyr Leu He Gin Ser 



345 



350 



Met Arg Gin Asp His Phe Val Ser Asp Thr Gly Asp Val Val Tyr Tyr 

360 365 
Gin Thr. Asp Ser Leu Leu Leu His Ser Gin Arg Asp 



375 



380 



(2) INFORMATION FOR SEQ ID NO; 12; 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH; 1675 base pairs 
(B) TYPE; nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
{ vi) ORIGINAL SOURCE : 

(A) ORGANISM: Glycine max 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: pSFD-116bwp 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 169.. 1530 



(xi) SEQUENCE DESCRIPTION: SEQ ID Np:12r 
CTGTGGCAAT TTTTCTCTTC TCCTTCTGGT TCTCATCTTT ' GTGTTCTTCT TTGTTTCTCA 
CCTTTCTGAG GATTTTTCCA TCTTAGTTCC TGGAGGCACC AGGAACCTGA CCAAATAAAT 
AAACCTTTTT TTTCTTCTAA TTTTTCTGAA GTTTCATTTT TTAGTCCA ATG GCA ACT 

1 

TGG TAT CAT CAG AAA tGT GGC TTG AAG CCT CTT GCT CCA GTA ATT CCT 
Tro .lyt His Gin Lys Cys Gly Leu Lys Pro Leu Ala Pro Val lie Pro 
5 10 15 

AGA CCT AGA ACT GGG GCT GCT TTG TCC AGO ACC TCA AGG GTT GAA TTT 
Arg Pro Arg Thr Gly Ala Ala Leu Ser Ser Thr Ser Arg Val Glu Phe 
20 25 ?0 

TTG GAC ACA AAC AAG GTA GTG GCA GGT CCT AAG TTT CAA CCT TTG AGG 
Leu 2p t"£ £n Lys Val Val Ala Gly Pro Lys Phe Gin Pro Leu Arg 
40 45 3U 

TGC AAC CTC AGG GAG AGG AAT TGG GGG CTG AAA GTG AGT GCC CCT TTG 
£J A^n 2u Glu Arg Asn Trp Gly Leu Lys Val Ser Ala Pro Leu 
55 6° 65 

AGG GTT GCT TCC ATT GAA GAG GAG CAA AAG AGT GTT GAT TTA ACC AAT 
S vll Ala 5r He Glu Glu Glu Gin Lys Ser Val Asp Leu Thr Asn 
70 75 W 

GGG ACT AAT GGG GTT GAG CAT GAG AAG CTT CCA GAA TTT GAC CCT GGT 
Gly Thr Asn Gly val Glu His Glu Lys Leu Pro Glu Phe Asp Pro Gly 
85 9° 

GCT CCG CCA CCA TTC AAC TTG GCT GAT ATT AGA GCA GCC ATT CCA AAG 
Ala Pro Pro Pro Phe Asn Leu Ala Asp He Arg Ala Ala He Pro Lys 
100 105 HO 115 



60 
120 
17-7 

225 

273 

321 

369 

417 

465 

513 
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CAT TGC TGG GTG AAG GAC CCT TGG AGG TCC ATG AGC TAT GTG GTG AGG 561 
His Cys Trp Val Lys Asp Pro Trp Arg Ser Met Ser Tyr Val Val Arg 
120 125 130 

GAT GTG ATT OCT GTC TTT GGT TTG GCT GCT GCT GCT GCG TAT CTC AAT 609 
Asp Val lie Ala Val Phe Gly Leu Ala Ala Ala Ala Ala Tyr Leu Asn 
135 140 145 

AAT TGG TTG GTT TGG CCT CTC TAT TGG GCT GCT CAA GGC ACT ATG TTC 657 
Asn Trp Leu Val Trp Pro Leu Tyr Trp Ala Ala Gin Gly Thr Met Phe 
150 155 160 

TGG GCT CTG TTT GTT CTT GGT CAT GAT TGT GGT CAT GGA AGC TTT TCA 705 
Trp Ala Leu Phe Val Leu Gly Mis Asp Cys Gly His Gly Ser Phe Ser 
165 170 175 

AAC AAC TCC AAA TTG AAC AGT GTT {JTT GGA CAT CTG CTG CAT TCT TCA 753 
Asn Asn Ser Lys Leu Asn Ser Val Val Gly His Leu Leu His Ser Ser 
180 185 190 195 

ATT CTA GTG CCA TAT CAT GGA TGG AGA ATC AGT CAT AGG ACT CAT CAC 801 
lie Leu Val Pro Tyr His Gly Trp Arg lie Ser His Arg Thr His His 
200 205 210 

CAA CAT GAT GGT CAT GCT GAA AAT GAT GAA TCA TGG CAT CCG TTG CCT 849 
Gin His His Gly His Ala Glu Asn Asp Glu Ser Trp His Pro Leu Pro 
215 220 225 

GAA AAA TTG TTC AGA AGC TTG GAC ACT GTA ACT CGT ATG TTA AGA TTC 897 
Glu Lys Leu Phe Arg Ser Leu Asp Thr Val Thr Arg Met Leu Arg Phe 
230 235 240 

ACA GCA CCT TTT CCA CTT CTT GCA TTT CCT GTG TAC CTT TTT AGT AGG 945 
Thr Ala Pro Phe Pro Leu Leu Ala Phe Pro Val Tyr Leu Phe Ser Arg 
245 250 255 

AGT CCT GGG AAG ACT GGT TCT CAC TTT GAC CCC AGC AGT GAC TTG TTC 993 
Ser Pro Gly Lys Thr Gly Ser His Phe Asp Pro Ser Ser Asp Leu Phe 
260 265 270 275 

GTT CCC AAT GAA AGA AAA GAT GTT ATT ACT TCC ACA GCT TGT TGG GCT 1041 
Val Pro Asn Glu Arg Lys Asp Val lie Thr Ser Thr Ala Cys Trp Ala 
280 285 290 

GCT ATG TTG GGA TTG CTT GTT GGA TTG GGG TTT GTA ATG GGT CCA ATT 1089 
Ala Met Leu Gly Leu Leu Val Gly Leu Gly Phe Val Met Gly Pro lie 
295 300 305 

CAA CTT- CTT AAG CTT TAT GGT GTT CCC TAT GTT ATA TTC GTT ATG TGG 1137 
Gin Leu Leu Lys Leu Tyr Gly Val Pro Tyr Val He Phe Val Met Trp 
310 315 320 



TTG GAT TTG GTG ACT TAT TTG CAC CAT CAT GGC CAT GAA GAC AAA TTA 
Leu Asp Leu Val Thr Tyr Leu His His His Gly His Glu Asp Lys Leu 
325 330 335 



1185 



WO 93/11245 



PCT/LS92/ 10284 



150 



CCT TGG TAC CGT GGA AAG GAA TGG AGC TAC CTC AGG GGT GGT CTA ACT 
Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr 
340 345 350 355 

ACT CTT GAT CGT GAT TAT GGA TGG ATC AAT AAC ATT CAC CAT GAC ATT 
Thr Leu Asp Arg Asp Tyr Gly Trp lie Asn Asn tie His His Asp lie 
360 / 365 370 

GGC ACT CAT GTC ATT CAT CAC CTA TTT CCT CAA ATT CCA CAC TAT CAC 
Gly Thr His Val lie His His Leu Phe Pro Gin lie Pro His Tyr His 
375 380 ( 385 

TTA GTT GAG GCT ACT GAG GCT GGT AAG CCA GTG TTT GGA AAA TAT TAT 
Leu Val Glu Ala Thr Glu Ala Ala Lys Pro Val Phe Gly Lys Tyr Tyr 
390 395 400 

AGA GAA CCA AAG AAA TCA GCA GCA CCT CTT CCT TTT CAC CTT ATT GGG 
Arg Glu Pro Lys Lys Ser Ala Ala Pro Leu Pro Phe His Leu lie Gly 
405 410 415 

GAA ATA ATA AGG AGC TTC AAG ACT GAC CAT TTT GTT AGT GAC ACG GGG 
Glu lie lie Arg Ser Phe Lys Thr Asp His Phe Val Ser Asp Thr Gly 
420 425 430 435 

GAT GTT GTG TAC TAT CAA ACC GAC TCT AAG ATT AAT GGC TCT TCC AAA 
Asp Val Val Tyr Tyr Gin Thr Asp Ser Lys lie Asn Gly Ser Ser Lys 
440 445 450 

TTA GAG TGAATATTAA AATTCTTTTC TATATAGACA AGAGAGGCTT ATACACAATT 
Leu Glu 

CTTATTGCTT TAAAGATTGT CTTGAGTTTC TCCGAAAGTT ACTGCACTTA CTTGGAGTTG 
AATCCTTCAT TAATAAAGGG ATGGATGGAT CATATAAA 



1233 
1281 
1329 
1377 
1425 
1473 
1521 

1577 

1637 
1675 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION : SEQ ID NO: 13: 

Met Ala Thr Trp Tyr His Gin Lys Cys Gly Leu Lys Pro Leu Ala Pro 
1 5 10 " 

Val lie Pro Arg Pro Arg Thr Gly Ala Ala Leu Ser Ser Thr Ser Arg 
20 25 30 

Val Glu Phe Leu Asp Thr Asn Lys Val Val Ala Gly Pro Lys Phe Gin 
35 40 45 
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Pro Leu Arg Cys Asn Leu Arg Glu Arg Asn Trp Gly Leu Lys Val Ser 
50 55 60 

Ala Pro Leu Arg Val Ala Ser lie Glu Glu Glu Gin Lys Ser Val Asp 
65 70 75 80 

Leu Thr Asn Gly Thr Asn Gly Val Glu His Glu Lys Leu Pro Glu Phe 
65 90 95 

Asp Pro Gly Ala Pro Pro Pro Phe Asn Leu Ala Asp lie Arg Ala Ala 
100 105 110 

lie Pro Lys His Cys Trp Val Lys Asp Pro Trp Arg Ser Met Ser Tyr 
115 120 125 

Val Val Arg Asp Val lie Ala Val Phe Gly Leu Ala Ala Ala Ala Ala 
130 135 140 

Tyr Leu Asn Ash Trp Leu Val Trp Pro Leu Tyr Trp Ala Ala Glri Gly 
145 150 155 160 

Thr Met Phe Trp Ala Leu Phe Val Leu Gly His Asp Cys Gly His Gly 
165 170 175 

Ser Phe Ser Asn Asn Ser Lys Leu Asn Ser Val Val Gly His Leu Leu 
180 185 190 

His Ser Ser He Leu Val Pro Tyr His Gly Trp Arg lie Ser His Arg 
195 200 205 

Thr His His Gin His His Gly His Ala Glu Asn Asp Glu Ser Trp His 
210 215 220 

Pro Leu Pro Glu Lys Leu Phe Arg Ser Leu Asp Thr Val Thr Arg Met 
225 230 235 240 

Leu Arg Phe Thr Ala Pro Phe Pro Leu Leu Ala Phe Pro Val Tyr Leu 
245 250 255 

Phe Ser Arg Ser Pro Gly Lys Thr Gly Ser His Phe Asp Pro Ser Ser 
260 265 270 

Asp Leu Phe Val Pro Asn Glu Arg Lys Asp Val He Thr Ser Thr Ala 
275 ' 280 285 

Cys Trp Ala Ala Met Leu Gly Leu Leu Val Gly Leu Gly Phe Val Met 
290 295 300 

Gly Pro lie Gin Leu Leu Lys Leu Tyr Gly Val Pro Tyr Val He Phe 
305 310 . 315 320 

Val Met Trp Leu Asp Leu Val Thr Tyr Leu His His His Gly His Glu 
325 330 335 

Asp Lys Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly 
340 345 350 
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Gly Leu Thr Thr Leu Asp Arg Asp Tyr Gly Trp lie Asn Asn He His 
355 360 365 

His Asp He Gly Thr His Val He His His Leu Phe Pro Gin He Pro 
370 375 380 

His Tyr His Leu Val Glu Ala Thr Glu Ala Ala Lys Pro Val Phe Gly 
385 390 395 400 

Lvs Tyx Tyr Arg Glu Pro Lys Lys Ser Ala Ala Pro Leu Pro Phe His 
405 410 415 

Leu He Gly Glu lie He Arg Ser Phe Lys Thr Asp His Phe Val Ser 
420 425 430 

Asp Thr Gly Asp Val Val Tyr Tyr Gin Thr Asp Ser Lys He Asn Gly 
435 440 445 

Ser Ser Lys Leu Glu 
450 

(2) INFORMATION FOR SEQ ID NO: 14; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(iiiy HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: pPCR20 
(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION; 31 ,.363 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 14: 

GGATCCACGC ATCATCAGAA TCACGGTCAC ATCCACAGGG ACGAGTCATG GCACCCGATC 60 

ACGGAGAAGC TGTACCGGCA ACTAGAGCCA CGCACCAAGA AGCTGAGATT CACGGTGCCC 120 

TTCCCCCTGC TCGCATTCCC CGTCTACCTC TTGTACAGGA GCCCCGGCAA GCTCGGCTCC 180 

CACTTCCTTC CCAGCAGCGA CCTGTTCAGC CCCAAGGAGA AGAGCGACGT CATGGTGTCA 240 
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ACCACCTGCT GGTGCATCAT GCTCGCCTCC CTCCTCGCCA TGGCGTGCGC GTTCGGCCCA 300 
CTCCAGGTGC TCAAGATGTA CGGCATCCCA TACCTGGTGT TCGTGATGTG GCTTGACCTG 360 
GTGACGTACT TAGATCACCA COTCCACGAT GGATGC 396 

(2) INFORMATION FOR SEQ ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : unknown 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 

(v) FRAGMENT TYPE: internal 
r (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: pPCR20 

<xi> SEQUENCE DESCRIPTION: SEQ ID N0:15: 

His His Gin Asn His Gly His He His Arg Asp Glu Ser Trp His Pro 
1 5 10 15 

He Thr Glu Lys Leu Tyr Arg Gin Leu Glu Pro Arg Thr Lys Lys Leu 
20 25 30 

Arg Phe Thr Val Pro Phe Pro Leu Leu Ala Phe Pro Val Tyr Leu Leu 
35 40 45 

Tyr Arg Ser Pro Gly Lys Leu Gly Set His Phe Leu Pro Ser Ser Asp 
50 55 60 

Leu Phe Ser Pro Lys Glu Lys Ser Asp Val Met val Ser Thr Thr Cys 
65 70 75 80 

Trp Cys He Met Leu Ala Ser Leu Leu Ala Met Ala Cys Ala Phe Gly 
85 90 95 

Pro Leu Gin Val Leu Lys Met Tyr Gly He Pro Tyr Leu Val Phe Val 
100 105 HO 

Met Trp Leu Asp Leu Val Thr Tyr Leu His His His Gly His 
115 120 12S 
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(2) INFORMATION FOR SEQ ID NO: 16: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic, acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii ) HYPOTHETICAL : NO 
(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Arabidopsis thaliana 
(vii) IMMEDIATE SOORCE: 

(B) CLONE: pFadx-2 and pYacp7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CCTCGAGCTA CGTCAGGGCT AAAACCAGGA ACTGGGCATT GAATGTGGCA ACACCTTTAA 60 
CAACTCTTCA GTCTCCATCC GAGGAAGACA GGGAGAGATT CGACCCAGGT GCGCCTCCTC 120 
CCTTCAATTT GGCGGATATA AGAGCAGCCA TACCTAAGCA TTGTTGGGTT AAGAATCCAT 180 
GGATGTCTAT GAGTTATGTT GTCAGAGATG TTGCTATCGT CTTTGGATTG GCTGCTGTTG 240 
CTdCTTACTT CAACAATTGG CTTCTCTGGC CTCTCTACTG GTTCGCTCAA GGAACCATGT 
TCTGGGCTCT CTTTGTCCTT GGCCATGACT GCGGACATGG TAGCTTCTCG AATGATCCGA 
GGCTGAACAG TGTGGCTGGT CATCTTCTTC ATTCCTCAAT CCTGGTCCCT TACCATGGCT 
GGAGGATTAG CCACAGAACT CACCACCAGA ACCATGGTCA TGTCGAGAAT GA 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

<B) TYPE: amino acid 

(C) STRAND ED NESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 

(v) FRAGMENT TYPE: N-terrainal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 



300 
360 
420 
472 
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PCr/L : S92/l0284 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: pFadx-2 and pYacp7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ser Ser Tyr Val Arg Ala Lys Thr Arg Aan Trp Ala Leu Asn Val Ala 
1 5 10 15 

Thr Pro Leu Thr Thr Leu Gin Ser Pro Ser Glu Glu Asp Arg Glu Arg 
20 25 30 

Phe Asp Pro Gly Ala Pro Pro Pro Phe Asn Leu Ala Asp lie Arg Ala 
35 40 45 

Ala He Pro Lys His Cys Trp Val Lys Asn Pro Trp Met Ser Met Ser 
50 55 60 

Tyr Val Val Arg Asp Val Ala He Val Phe Gly Leu Ala Ala Val Ala 
65 70 75 80 

Ala Tyr Phe Asn Asn Trp Leu Leu Trp Pro Leu Tyr Trp Phe Ala Gin 
85 90 95 

Gly Thr Met Phe Trp Ala Leu Phe Val Leu Gly His Asp Cys Gly His 
100 105 110 

Gly Ser Phe Ser Asn Asp Pro Arg Leu Asn Ser Val Ala Gly His Leu 
115 120 125 

Leu His Ser Ser He Leu Val Pro Tyr His Gly Trp Arg He Ser His 
130 135 140 

Arg Thr His His Gin Asn His Gly His Val Glu Asn 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..11 

(D) OTHER INFORMATION: /note= tt N« INOSINE" 
<ix) FEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION: 12.. 31 

(D) OTHER INFORMATION: /note- W N- A OR T OR G OR C" 
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(xi) SEQUENCE DESCRIPTION: SE£f ID NO: 18; 
CGGGATCCAC NCAYCAYCAR AAYCAYGGNC A 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: ) 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /note- "N= INOSINE" 
(ix) FEATURE; 

(A) NAME/KEY; misc feature 

(B) LOCATION: 16 ..35 

(D) OTHER INFORMATION: /note- "N= A OR T OR G OR C 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGGGATCCRT CRTGNCCRTG RTGRTGNARR TANGT 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..36 

(D) OTHER INFORMATION: /note= W N~ INOSINE" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTCGTNNTNG GNCAYGAYTG YGGNCAYGGN CAYGGNAGNT TC 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 
<B> LOCATION: 1..36 

<D) OTHER INFORMATION: /note- W N» INOSINE" 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TTCGTNNTNG GNCAYGAYTG YGGNCAYGGN TCNTTC 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GGHCAYGAYT GYGGHCAC 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GGHCAYGAYT GYGGHCAT 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



GTACTRTARC CDTGDGTR 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GTGCTRTARC CDTGDGTR 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
GTRCANTARG TRGTRAAYAA YGG 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GTRCANTADG TRGTRGADAA YGG 



(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:. 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 



(A) NAME/KEY: misc feature 

(B) LOCATION: 1- .36 

(D) OTHER INFORMATION: /note~ 



INOSINE" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TTCGTNNTNG GNCAYGAYTG YGGNCAYGGN AGNTTT 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ix) FEATURE: 

(A) NAME /KEY: misc feature 

(B) LOCATION: 1..36 

(D) OTHER INFORMATION: /note- "N- INOSINE" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TTCGTNNTNG GNCAYGAYTG YGGNCAYGGN TCNTTT 

(2) INFORMATION FOR SEQ ID NO: 30: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ux) FEATURE: 

(A) NAME /KEY: misc feature 

(B) LOCATION: 1..38 

(D) OTHER INFORMATION: /note- W N« INOSINE" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GTRCTRTANC CNTGNGTNCA NTANGTAGTG RANAAGGG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: misc feature 

(B) LOCATION: 1.-38 
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(D) OTHER INFORMATION: /note= "N e INOSINE w 
(aci) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTRCTRTANC CNTGNGTNCA NTANGTGGTG RANAAGGG 38 

(2) INFORMATION FOR SEQ ID NO; 32: 

(i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

iix) FEATURE: 

(A) NAME /KEY: misc feature 

(B) LOCATION: 1..135 

(D) OTHER INFORMATION: /nOte« "N« INOSINE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GTGGTGNGTN CNGTGNGANA NNCKCCANCC GTGGTANGGN, ACNANNANGA ANGANGAGTG 60 
KANNANGTGN CCNACNANNG AGTTNANNAN NGGNATNTCN GAGAANGANC CGTGNCCGCA 120 

138 

NTCGTGNCCN ANNACGAA 
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CLAIMS 

1. An isolated nucleic acid fragment comprising a 
nucleic acid sequence encoding a fatty acid desaturase 
or a fatty acid desaturase-related enzyme with an amino 

5 acid identity of 50% or greater to the polypeptide 
encoded by SEQ ID NQS : 1 , 4, 6, 8, 1-0, 12, 14 or 16. 

2. The isolated nucleic acid fragment of Claim 1 
wherein the amino acid identity is 65% or greater to the 
polypeptide encoded by SEQ ID N0S:1, 4, 6, 8, 10, 12, 14 

10 or 16. 

3. The isolated nucleic acid fragment of Claim 1 
wherein the nucleic acid identity is 90% or greater to 
SEQ ID NOStl, 4, 6, 8, 10, 12, 14 or 16. 

4. An isolated nucleic acid fragment of Claim 1 
15 wherein said fragment is isolated from a plant selected 

from the group consisting of soybean, oilseed Brass ica 
species, Arahidopsls thaliana and corn. 

5. A chimeric gene capable of causing altered 
levels of linolenic acid in a transformed plant cell, 

20 the gene comprising a nucleic acid fragment of any of 
Claims 1, 2, or 3, the fragment operably linked to 
suitable regulatory sequences. 

6. Plants containing the chimeric genes of 
Claim 5. 

25 7. Oil obtained from seeds of the plants 

containing the chimeric genes of Claim 5 . 

8, A method of producing seed oil containing 
altered levels of linolenic (18:3) acid comprising: 

(a) transforming a plant cell of an oil- 
30 producing species with a chimeric gene of Claim 5; 

(b) growing fertile plants from the 
transformed plant cells of step (a) ; 

(c) screening progeny seeds from the fertile 
plants of step (b) for the desired levels of linolenic 

35 (18:3) acid; and 
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(d) processing the progeny seed of step (c) 
to obtain seed oil containing altered levels of 
linolenic (18:3) acid. 

9 The product of the method of Claim 8, 
5 10. A method of Claim 8 wherein said plant cell of 

1 an oil-producing species is selected from the group 
consisting of ft^Mdopsis thflliana, soybean, oilseed 
RT-assica species, sunflower, cotton, cocoa, peanut, 
saf flower, and corn, 
10 xi* A method of breeding plant species producing 

altered levels of linolenic acid in the seed pil of oil- 
producing plant species comprising: 

(a) making a cross between two varieties of 
oil-producing species differing in the linolenic acid 

15 trait; 

(b) making a Southern blot of restriction 
enzyme digested genomic DNA isolated from several 
progeny plants resulting from the cross of step (a); and 

(c) hybridizing the Southern blot with a 
20 radiolabeled nucleic acid fragment of Claim 1. 

12. The product of the method of Claim 11. 

13. A method of RFLP mapping in a genomic RFLP 

marker comprising: 

(a) making a cross between two varieties of 

25 plants; 

(b) making a Southern blot of restriction 
enzyme digested genomic DNA isolated from several 
progeny plants resulting from the cross of step (a) ; and 

(c) hybridizing the Southern blot with a 
30 radiolabeled nucleic acid fragments of Claim 1. 

14. A method to isolate, nucleic acid fragments 
encoding fatty acid desaturases and fatty acid 
desaturase-related enzymes, comprising: 
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(a) comparing SSQ ID N0S:2, 5, 7, 9, 11, 13, 
15 and 17 with other fatty acid desaturase polypeptide 
sequences; 

(b) identifying the conserved sequence is) of 
5 4 or more amino acids obtained in step (a); 

(c) making region-specific nucleotide 

probe (s) or oligomer (s) based on the conserved sequences 
identified in step b; and 

(d) using the nucleotide probe (s) or 

10 oligomers (s) of step c to isolate sequences encoding 

fatty acid desaturases and fatty-acid desaturase-related 
enzymes by sequence-dependent protocols - 

15. The product of the method of Claim 14 ♦ 

16. The isolated genomic DNA of Arabidopsis 
15 t ha liana identified by accession number ATCC 75167. 

17. An isolated cDNA clone which encodes for 
soybean delta-15 desaturase , the clone designated pXFl 
comprising the DNA sequence of SEQ ID NO 10 and 
identified by accession number ATCG 68874 . 

20 18. An isolated cDNA clone which encodes for 

oilseed ttrassica species delta-15 desaturase, the clone 
designated pBNSF3 comprising the DNA sequence of SEQ ID 
NO: 6 and identified by accession number ATCC 68854. 

19. An isolated Polymerase Chain Reaction Product 

25 for Zea mays delta-^15 desaturase, the clone designated 
pcr20 comprising the DNA sequence of SEQ ID NO: 14. 
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Inoculated root disks were incubated at 25°C in the 
dark in petri dishes sealed with Parafilm. After two 
weeks of co-cultivation of carrot disks with 
Agrrtbact e r inm . the carrot disks were transferred to 
5 fresh agar-solidif ied water medium containing 500 mg/L 
carbenicillin for the counterseiection of Aarobacterium. 
At this time, hairy root formation was noted on some 
root disks. Transfer of the explants to fresh 
counterseiection medium was done at four weeks. 

10 Excision of individual roots from the explants was begun 
at six weeks. Ten days later, additional roots were 
taken from the explants as needed. 

Approximately 5-10 mm long hairy roots were excised 
and individually subcultured on MS minimal organics 

15 medium with 30 g/L sucrose (Gibco, Grand Island, N. Y., 
Cat, No. 510-1118EA) and 500 mg/L carbenicillin. 
Approximately equal numbers of roots were subcultured in 
liquid medium and in a medium solidified with 0.6% 
agarose. Cultures on solid medium were grown in 60 x 

20 100 mm petri dishes, liquid cultures were in 6-well 

culture dishes. When excising roots, an effort was made 
to select single roots from distinct callus-like 
outgrowths on the wounded surface. These sites of 
excision were marked on the lid of the petri dish to 

25 minimize repeat sampling of tissue originating from the 
same transformation event. 

Two to three weeks after excision from the 
explants, individual hairy root cultures that were not 
visibly contaminated with Agrrohagterium were transferred 

30 to -fresh MS medium supplemented with 500 mg/L 

carbenicillin. The root -mass of each culture was cut 
into segments including one or more branch roots, and 
these segments were transferred as a group to a plate or 
well of fresh medium. Approximately 20 mg fresh weighs 

35 of tissue of root cultures which grew to adequate size 
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within the next two to three weeks were sampled for 
fatty acid composition by gas chromatography of the 
fatty acyl methyl esters essentially as described by 
Browse et al., (Anal. Biochem. (1986) 152:141-145) 
5 except that 2,5% H2SO4 in methanol was used as the 

methylation reagent and samples were heated for 1.5 h at 
80°G to effect the methaholysis of the seed 
triglycerides. The results are shown in Table 6. A 
second sample of tissue consisting of an actively 

10 growing root tip of approximately 1 cm was excised and 
placed on MS medium supplemented with 500 rog/L 
carbenicillin and 25-50 mg/L kanamycin to test for 
kanamycin resistance select for hairy roots co- 
transformed with the binary vector [Simpson et al. 

15 (1986) Plant Mol. Biol. 6:403-415] . 
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TABLE 6 

Percent 18:3 and 18:2/18:3 Ratio in 
Roots of Transgenic Carrots 



kqql sainpjLfi 


Transformation 


tin » J 


118 :Z/lcUJ 


1 


•% ^ AAA * ' - * * * ^ * 

Rl000/pAW31 


62 


0,09 


2 


R1000/pAW31 


8 


7.30 


3 


R1000/pAW31 


10 


5.69 


4 


Rl000/pAW3l 


62 


0 . 06 


5 


R1000/pAW31 


10 


5.07 


6 


R1000/pAW31 


4 


14.2 


7 


R1000/pAW31 


61 


0.18 


8 


R1000/pAW31 


4 


15.1 


9 


R1000/DAW31 


61 


0.07 


10 


R1000/pAW31 


63 


0.09 


11 


R1000/pAW31 


15 


3.04 


12 


R1000/pAW31 


64 


0.14 


13 


R1000/pAW31 


5 


9.94 


14 


R1000/pAW31 


9 


6.72 


15 


R1000/pAW31 


8 


7.08 


16 


R1000/pAW31 


8 


6.31 


17 


R1000/pAW31 


23 


1.86 


18 


R1000/pAW31 


8 


7.33 


19 


R1000/pAW31 


10 


5.99 


20 


R1000/pAW31 


7 


8.83 


21 


R1000/pAW32 


9 


6.80 
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Root Sanrole 


Transformation 
Vector o«t*»ri 


41fl^3 


ilfl* TA18>3 


22 


R1000/pAW32 


4 


11 .8 


23 


R1000/pAW32 


3 


18,8 


24 


R1000/pAW32 


10 


6.21 


25 


R1000/pAW32 


7 


8.57 


26 


RlO0O/pAW32 


3 


16.4 


27 


Rl00O/pAW32 


6 


8.29 


28 


Rl000/nAH32 


5 




29 


R1000/pAW32 


5 


8. 47 


30 


R1000/pAW32 


8 


7.17 


31 


R1000/pAW32 


4 


11.9 


32 


R1000/pAW32 


8 


7.20 


33 


R1000/dAW32 


5 


10-4 


34 




3 






ftl 000 /r»AW^9 




17 9 
if 


JO 


Ol rtflrt /«Atff39 
Ki.UUU/pnnj£ 


o 
0 


7 9*7 




PI flf) A /nltJ^9 
tSJL U vU / pAVV j£ 


q 


C fll 
D.UX 


JO 


KlUvU/pAWJZ 


Q 




an 

*t u 


KX U U U / panJZ 


Q 


O • 


4 1 
ti 




O 

o 


7 93 


45 
ha 


R1000 
iuvvv 


a 

o 


7 A3 


43 


RIO 00 


10 


6.20 


44 


R1000 


9 


5.97 


45 


R1000 


9 


6.73 


46 


R1000 


9 


6.27 


47 


R1000 


8 


7.27 


48 


R1000 


7 


8.30 


49 


R1000 


9 


7.11 



The ability of R10Q0 transformed "hairy" roots to 
grow in the absence of exogenous phytohormones can' be 
attributed to the Ri plasmid, pRiA4b. When R1000/pAW31 
5 or R1000/pAW32 strains are used to transform, only a 
fraction (about half) of the "hairy" roots will also be 
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transformed with the experimental binary vector, pAW31 
or pAW32. Thus, as expected, not all hairy roots 
resulting from transformation with R1000/pAW31 show the 
high 18:3 phenotype. The absense of any significant 
5 fatty acid phenotype in -hairy roots" transformed with 
R1000/PAW31 is expected, since carrot and Ar^hirinp .^ 
delta-15 desaturase sequences are not expected to be 
sufficiently related. These results show that 
overexpression of Arflh i flops H microsomal delta-15 
desaturase can result in over 10-fold increase in 18:3 
at the expense of 18:2 in heterologous plant tissue. 
Overexpression of Arabjdopfn s Delta-15 Fatty Acid 
Desaturase in Seeds and Complementation of the 

15 To complement the delta-15 desaturation mutation in 

the T-DNA mutant 3707 and to test the biological effect 
of overexpression of SEQ ID NOM ( Arabiriop ^ <: microsomal 
delta-15 fatty acid desaturase) in seed, the embryo- 
specific promoter: SEQ id NO:i chimeric gene was 
20 transformed into the mutant plant. This embryo-specific 
expression cassette in pAW42 was produced, in part, 
using a modified version of vector pCW109. Vector 
PCW109 itself was made by inserting into the Hind III 
site of the cloning vector pUC18 (Bethesda Research 
25 Laboratory) a 555 bp 5' non-coding region (containing 
the promoter region) of the p-conclycinin gene followed 
by the multiple cloning sequence containing the 
restriction endonuclease sites for Nco I, Sma I, Kpn I 
and Xba I, then 1174 bp of the common bean phaseolin 3' 
untranslated region into the Hind ill site [Slightom e t 
al., Proc. Nat' 1 Acad.- Sci. U.S.A. (1983) 80:1897-1901]. 
The P-conclycinin promoter region used is an allele of 
the published JJ-conglycinin gene (Doyle et al., J. Biol. 
Ghem. (1986) 261:9228-9238) due to differences at 27 
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nucleotide positions. Further sequence description may 
be found in Slightom (W091/13993) . 

The modifications to vector pCW109 were as follows: 
The potential translation start site was destroyed by 
5 digestion with Nco I and Xba I restriction enzymes 
followed by treatment with mung bean nuclease (New 
England Biolabs) to create linear, blunt ended DNA 
molecules. After ligation of Not I linkers (New England 
Biolabs) and digestion with Not I restriction enzyme 

10 (New England Biolabs) the plasmid was recircularized. 
Confirmation of the desired change was obtained by 
dideoxy sequencing. The resulting plasmid was 
designated P AW35. The 1.8 kB Hind III fragment from 
P AW35 containing the modified p-conclycinin promotor/3' 

15 phaseolin region was subcloned into the Hind III site in 
plasmid vector pBluescript SK+ (Stratagene) creating 
plasmid P AW36. Plasmid P AW36 was linerized at its 
unique Eco RI site and ligated to Eco Rl/Xho I adaptors 
(Stratagene). Following digestion with Xho I, the 1.7 

20 kB Xho I fragment containing the p-conclycinin 

promotor/Not I site/3 '-phaseolin untranslated region was 
cloned into the Xho I site in pURA3 vector (Clonetech) . 
The resultant plasmid, pAW42, contains the seed specific 
expression cassette bordered by Xho I sites to 

25 facilitate cloning into modified T-DNA binary vectors 
and a unique Not I site to facilitate cloning of target 
cDNA sequences. Vector pURA3 was choosen due to the 
absence of a Not I restriction site, the presence of a 
single Xho I restriction site, and the relatively large 

30 size of the vector ( P URA3) would make the isolation of 
the gene expression cassettes relatively easy from the 

final construct. 

The 1.4 kB Not I fragment in plasmid pCF3 
containing Arabidopsis microsomal delta-15 desaturase 
35 (SEQ ID NO:l) was isolated and ligated to plasmid pAW4 2 
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(the seed-specific expression cassette) that had 
previously been linearized with Not I restriction enzyme 
and treated with calf intestinal alkaline phosphatase 
(Boehringer Mannheim) to result in plasmids pAW45 that 

5 had SEQ ID NO:l cloned in a sense orientation with 
respect to the promoter. The orientation of the cDNA 
relative to the promotors was established by digestion 
with appropriate restriction endonucleases or by 
sequencing across the promotor-cDNA junctions . 
TO The chimeric 0-conclycinin promot or /sense SEQ IP 

NO : 1/phaseolin 3* was isolated as a 3.2 kB Xho I 
fragment from plasmid pAW4 5 and subcloned into the 
binary vector pAW25 at its unique Sal I site. In the 
resulting vector, pAWSO, the orientation of the plant 

15 selectable marker is. the same as that of the 

p-conclycinin promoter as acertained by digestion with 
appropriate restriction endonucleases . Plasmid pAW25, 
is derived from plasmids pZS94K and P ML2 . Plasmid 
pZS94K contains the pBR322 origin of replication, the 

20 replication and stability regions of the PfifM i riomgnas 
^mainosa plasmid pVSl [Itoh, et al., (1984) Plasmid 
11:206-^220] required for replication and maintenance of 
the plasmid in ^w^^rlum. the bacterial NPT II gene 
(kanamycin resistance) from Tn5 IB'erg et al., (1975) 

25 Prbc. Nat'l. Acad. Sci. U.S.A. 72:3628-3632] as a 

selectable marker for transformed bacteria, a T-DNA left 
border fragment of the octopine Ti plasmid pTiA6 and 
right border fragment derived from TiAchS describe by 
van den Elzen et al. (Plant Mol. Biol. (1985) 

30 5:149-154). Between these borders are the £. coli. lacZ 
a-complementing segment- [Vieria and Messing (1982) Gene 
19:259-267] with restriction endonuclease sites Sal I 
and Asp 718 derived from P UC18. A 4.5 kB Asp 718-Sal I 
DNA fragment containing the chimeric herbicide 

35 sulfonylurea (SU) -resistant acetolactate (ALS) gene was 
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obtained from plasmid pML2 and cloned into the Asp 
718r-Sal I sites of plasmid pZS94K. This chimeric ALS 
gene contained the CaMV 35S prpmoter/Cab22L Bgl II-Nco I 
fragment that is described by Harpster et al., IMol. 
5 Gen. Genet. (1988) 212:182-190) and the Arahidops i s ALS 
coding and 3* non-coding sequences [Mazur et al., (1987) 
Plant Physiol. 85:1110-1117] that was mutated so that it 
encodes a SU- resistant form of ALS. The mutation, 
introduced by site-directed mutagenesis/ are those 

10 present in the tobacco SO-resistant Hra gene described 
by Lee et al., (1988) EMBO J. 5:1241-1248. The 
resulting plasmid was designated pAW25 . 

The binary vector pAW25 containing the chimeric 
embryo-specific P^conglycinin promoter: sense SEQ ID N0:1 

15 gene was transformed by the freeze/thaw method (Holsters 
et al., (1978) Mol. Gen. Genet. 163:181-187] into the 
avirulent Agrobacterium strain LBA4404/pAL4404 [Hoekema 
et al., (1983) Nature 303:179-180]. 

fl ^Hidnpsis root cultures were transformed by co- 

20 cultivation with MT-ohaci-ftrium using standard aseptic 
techniques for the manipulation of sterile media and 
axenic plant/bacterial cultures were followed, including 
the use of a laminar flow hood for all transfers. 
Compositions of the culture media are listed in Table 8. 

25 Unless otherwise indicated, 25x100 mm petri plates were 
used for plant tissue cultures. Incubation of plant 
tissue cultures was at 23°C under constant illumination 
with mixed fluorescent and "Gro and Sho" plant lights 
(General Electric) unless otherwise noted. To initiate 

30 ia vitro root cultures of the T-DNA homogyzous mutant 
line 3707 / &™Mrfopsis rha liana (L.) Heynh, geographic 
race Wassilewshi ja) seeds of the mutant line were 
sterilized for 10 min in a solution of 50% Chlorox with 
0.1% SDS, rinsed 3 to 5 times with sterile dH 2 0, dried 

35 thoroughly on sterile filter paper, and then 2-3 seeds 
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were sown in liquid B5 medium in 250 mL Belco flasks. 
The flasks were capped, placed on a rotary shaker at 
70^-80 rpm, and incubated for 3-4 weeks. Prior to 
inoculation with Acrrobanterium, root tissues were 
5 cultured on callus induction medium (MSKig) . Roots were 
harvested by removing the root mass from the Belco 
flask, placing it in a petri dish, and, using forceps, 
pulling small bundles of roots from the root mass and 
placing them on MSKig medium. Petri dishes were sealed 

10 with filter tape and incubated for four days. 

^rohartftrium strain LBA4404 carrying the plasmids 
pAL4404 and pAW50 were grown in 5 mL of YEB broth 
containing 25 mg/L kanamycin and 100 mg/L rifampicin. 
The culture was grown for approximately 17-20 h in glass 

15 culture tubes in a New Brunswick platform shaker (225 
rpm) maintained at 28°C. Pre-cultured roots were cut 
into 0.5 cm segments and placed in a 100 \im filter, made 
from a Tri-Pour beaker (VWR Scientific, San Francisco, 
CA USA) and wire mesh, which is set in a petri dish. 

20 Root segments were inoculated for several min in 30-50 
mL of a 1:20 dilution of the overnight ftgroftacterium 
culture with periodic gentle mixing. Inoculated roots 
were transferred to sterile filter paper to draw off 
most of the liquid. Small bundles of roots, consisting 

25 of several root segments, were placed on MSKig medium 
containing 100 JIM acetosyringone (3 • , 5 •-Dimethoxy-4 
hydroxyaceto-phenone, Aldrich Chemical Co., Milwaukee, 
WI, USA) . Petri plates were sealed with paraf ilm or 
filter tape and incubated for 2 to 3 days. 

30 After infection, root segments were rinsed and 

transferred to shoot induction medium with antibiotics. 
Root bundles were placed in a 100 Jim filter unit 
(described above) and rinsed with 30-50 mL liquid MSKig 
medium. The filter was vigorously shaken in the 

35 solution to help remove the ftgroftflKUrium, transferred 
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to a clean petri dish, and rinsed again. Roots were 
blotted on sterile filter paper and bundles of roots 
were placed on MSg medium containing 500 mg/1 vancomycin 
arid either 10 or 20 ppb chlorsulf uron . Plates were 
5 sealed with filter tape and incubated for 12 to 14 days. 
Green nodules and small shoot primordia were 
visible at about 2-3 weeks. The explants were either 
left intact or were broken into numerous pieces and 
placed on GM medium containing 200-300 mg/L vancomycin 

10 and either 10 or 20 ppb chlorsulfuron for further shoot 
development. Plates were either sealed with two pieces 
of tape or with filter tape . As they developed, 
individual shoots were isolated from the callus and were 
placed on MSRg medium containing 100 mg/L vancomycin and 

15 either 10 or 20 ppb chlorsulfuron. Dishes were sealed 
as described above and incubated for seven to 10 days. 
Shoots were then transferred to GM medium containing 
100-200 mg/L vancomycin in 25x100 petri dishes or 
Magenta G7 vessels. Many primary trans formants (Tl) 

20 which were transferred to individual containers set seed 
(T2) . 

T2 seed was harvested from selected putative 
transformants and sown on GM medium containing lOppb 
chlorsulfuron. Plates were sealed with filter tape, 

25 cold treated for 2 or more days at 4°C, and then 
incubated for 10 to 20 days at 23°C under constant 
illumination as described above. Seedlings were scored 
as resistant (green, true leaves develop) and sensitive 
(no true leaves develop) . 

30 Selected chlorsulfuron resistant T2 seedlings were 

transplanted to soil and were grown to maturity at 23°C 
daytime (16 h) 18'C nighttime (8 h) at 65-80% relative 
humidity. 

T2 seeds from two plants were harvested at maturity 
35 and analysed individually for fatty acid composition by 
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gas chromatography of the fatty acyl methyl esters 
essentially as described by Browse et al . , (Anal . 
Biochera. (1986) 152:141-145) except that 2.5% H2SO4 in 
methanol was used as the methylation reagent and samples 
5 were heated for 1.5 h at 80°C to effect the methanolysis 
of the seed triglycerides. The results are shown in 
Table 7. 



TABLE 7 

Percent Fatty Acid in Seeds of 
Transgenic Mutant 3707 



Seed Sample 


16:0 


1B;0 


lfi;l 


^ lfl:2 


IB: 3 


wildtype(6) 


6 


4 


14 


30 


19 


mutant 3707(6) 


6 


4 


14 


44 


3 


1-1 


10 


4 


22 


9 


55 


1-2 


11 


6 


22 


14 


48 


1-3 


12 


7 


16 


6 


57 


1-4 


10 


4 


30 


52 


4 


1-5 


10 


4 


18 


17 


48 


1-6 


10 


5 


15 


15 


53 


2-1 


11 


5 


19 


60 


4 


2-2 


10 


5 


19 


9 


56 


2-3 


9 


4 


27 


8 


52 


2-4 


10 


5 


17 


10 


56 


2-5 


10 


5 


19 


9 


56 


2-6 


10 


5 


17 


17 


48 


The fatty 


acid composition of the 


wild-type 


and 



mutant line 3707 represents the average of 6 single 
iO seeds each. Seeds from plant 1 are designated 1-1 to 
1-6 and those from plant 2 are designated 2-1 to 2-6. 
The 20:1 and 20:2 amounts are not shown. The data shows 
that, the one out of six seeds in each plant show the 
mutant fatty acid phenotype, while the remaining seeds 
15 show more than 10-fold increase in 18:3 to ca.55%. 
While most of the increase occurs at the expense of 
18:2, some of it also occurs at the expense of 18:1. 
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Such high levels are of linolenic acid in vegetable oils 
are observed in specialty oil crops, such as linseed. 
Thus f overexpression of this gene in other oilscrops, 
especially canola, which is a close relative of 
^hirion.sis, is also expected to result in such high 

levels of 18:3. 

Medium Composition 



BASIC MEDIUM 



5.0 g 
1.0 g 
5.0 g 
5.0 g 
0.5 g 
15.0 g 



YEP MEDIUM 
Bacto Beef Extract 
Bacto Yeast Extract 
Peptone 
Sucrose 
MgS04-7H20 
Agar (optional) 
PH 

VITAMIN SUPPLEMENT 
10 mg/L thiamine 
50 mg/L pyridoxine 
50 mg/L nicotinic acid 

« « MSa - Shoot Induction Medium 
MSKIg * Callus Induction Medium 

Basic Medium 



1 Pkg. Murashige and Skoog 
Minimal Organics Medium without 
Sucrose {Gibco #510-3118 or 
Sigma #M6899) 

10 ml* Vitamin Supplement 

0.05% MES 0.5 g/L 

0.8% agar 8 g/L 

pH 

GM «= Germination Medium 
Basic Medium 
1% sucrose 10 g/L 



2% glucose 20 g/L 
0.5 mg/L 2,4-D 2.3 \IL 

0-3 mg/L Kinetin 1.4 pH 

5 mg/L IAA j&.5 JIM 

MSRg » Shoot Induction Medium 

Basic Medium 

2% glucose 20 g/L 

12 mg/L IBA 58.6 uM 

0.1 mg/L Kinetin 0.46*HM 



Basic Medium 
2% glucose 
0.15 mg/L IAA 
5.0 mg/L 2iP 



20 g/L 

0.86 UM 
24.6 JIM 
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EXAMPLE 10 

Construction of Vectors for Transformation 

of Rr aqsiGa napus for Reduced Expression 
ftf pjOfa-lS De sfli-nrases in Developing Seeds 
5 Detailed procedures for manipulation of DNA 

fragments by restriction endonuclease digestion, size 
separation by agarose gel electrophoresis, isolation of 
DNA fragments from agarose gels, ligation of DNA 
fragments, modification of cut ends of DNA fragments and 
10 transformation of £♦ £0H cells with circular DNA 
plasmids are all described in Sarobrook et al., 
(Molecular Cloning, A Laboratory Manual, 2nd ed (1989) 
Cold Spring Harbor Laboratory Press) and Ausubel et al. , 
Current Protocols in Molecular Biology (1989) John Wiley 

15 & Sons) - 

Sequences of the cDNA's encoding the £. napiLS 

cytoplasmic delta-15 desaturase and the Brassicfl napus 
plastid delta-15 desaturase were placed in the antisense 
orientation behind the promoter region from the a' 
20 subunit of the soybean storage protein p-conglycinin to 
provide embryo specific expression and high expression 
levels . 

An embryo-specific expression cassette was 
constructed to serve as the basis for chimeric gene 

25 constructs for anti-sense expression of the nucleotide 
sequences of delta-15 desaturase cDNAs. The vector 
pCWl09 was produced by the insertion of 555 base pairs 
of the {J-conglycinin (a 1 subunit of the 7s seed storage 
protein) promoter from soybean f Glycine maa) , the 

30 P-conglycinin 5' untranslated region followed by a 
multiple cloning sequence containing the restriction 
endonuclease sites for Nco I, Sma I, Kpn I and Xba I, 
then 1174 base pairs of the common bean phaseolin 3' 
untranslated region into the Hind III site in the 

35 cloning vector pUC18 (BRL) . The p-conglycinin promoter 
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sequence represents an allele of the published 
P-conglyeinin gene (Doyle et al., (1986) J. Biol. Chem. 
261:9228-9238) due to differences at 27 nucleotide 
positions. Further sequence description may be found in 
5 Slightom (W091/13993) . The sequence of the 3' 
untranslated region of phaseolin is described in 
(Slightom et al.,(1983) Proc. Natl. Acad. Sci. USA, 

80:1897-1901) . 

To facilitate use in antisense constructions, the 

10 Nco I site and potential translation start site in the 
plasmid pCW109 was destroyed by digestion with Nco I, 
mung bean exonuclease digestion and re-ligation of the 
blunt site to give the modified plasmid pCW109A. 
P CW109A was opened between the p*-cbnglycinin promoter 

15 sequence and the phaseolin 3 ' sequence by digestion with 
Sma I to allow insertion of blunt ended cDNA fragments 
encoding the delta-15 desaturase sequences by ligation. 
The blunt ended fragment of the cytoplasmic delta-15 
desaturase was obtained from plasmid pBNSF3, which 

20 contains the nucleotides 208 to 1336 of the cDNA insert 
described in SEQ ID NO: 6. pBNSF3 was modified to remove 
the Hind III site at bases 682 to 687 of SEQ ID 6 by 
digesting with Hind III, blunting with Klenow and re- 
iigating. The resulting plasmid [pBNSF3 (-H) ] , was 

25 digested with Eco RI and Xho I to release the delta-15 
cDNA fragment, all ends were Klenow blunted and the 1.2 
kB coding region was purified by gel isolation. The 1.2 
kB fragment was ligated into the Sma I cut pGW109A 
described above. The antisense orientation of the 

30 inserted cDNA relative to the pVconglycinin promoter was 
established by digestion with Aat I which cuts in the 
delta-15 desaturase coding region and in the vector 5* 
to the 0-conglycinin promoter to release a 1.4 Kb 
fragment when the coding region is in the antisense 
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orientation. The antisense construction was given the 
name pCCFdRl . 

The transcription unit [P-conglycinin 

promoter: antisense delta-15 desaturase :phaseolin 3' end] 
5 was released from pCCFdRl by Hind III digestion, 

isolated, and ligated into pBluescript which had also 
been Hind III digested to give plasmid pCCFdR2. This 
construct has unique BamH I and Sal I sites which were 
digested. The 3 JcB transcriptional unit was isolated 

10 and cloned into the Bam HI and Sal I sites in pZ199 

described below to give the binary vector pZCC3FdK, The 
orientation given by this directional cloning is with 
transcription of both the selectable marker gene and the 
delta- 15 antisense gene in the same direction and toward 

15 the right border tDNA sequence. 

An antisense construction based on the plastid 
delta-15 desaturase was made with the 425 most 3' bases 
of SEQ ID NO: 8 which is contained in the plasmid pBNSFD- 
8. pBNSFD-8 represents a cDNA of the plastid delta-15 

20 desaturase in pBluescript. The cDNA insert was removed 
from pBNSFD-8 by digestion with Xho I and Sma I, the 
fragments were blunted, and the 425 base insert isolated 
by gel purification. The isolated fragment was cloned 
into the Sma I site of pCW109A and the antisense 

25 orientation of the chosen clone confirmed by digestion 
of the plasmid with Pst I . Pst I cuts in the plastid 
delta-15 sequence and in the pCW109A vector 5 1 to the 
P-conglycinin promoter to release a 1.2 kB fragment 

indicative of the antisense orientation. The plasmid 
30 containing this construction was called pCCdFdRl. 

Digestion of pCCdFdRl with Hind III removes a 2.3 
*• kB fragment containing the transcriptional unit 

[|}-conglycinin promter :plastid delta-15 antisense:3 

i phaseolin sequence]. The fragment was gel isolated and 

35 cloned into Hind III digested pBluescript. The 
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orientation of the fragment was relative to the Bam HI 
site in the cloning region of pBluescript was determined 
by digestion with Pst I as described above, A clone 
oriented with the promoter toward the Sal I containing 
5 end was chosen and given the name pCCdFdR2. 

pGGdFdR2 was digested with Bam HI and Sal I, the 
released fragment was gel isolated and ligated into 
pZ199 which had been digested with Bam HI and Sal I to 
give the binary vector pZCCdFdR. 
10 Vectors for transformation of the antisense 

delta-15 desaturase constructions under control of the 
{J-conglycinin promoter into plants using Agrohar?tf>rium 
MiTn^faeiens were produced by constructing a binary Ti 
plasmid vector system (Bevan, (1984) Nucl. Acids Res. 
15 12:8711-8720). The starting vector used for these 

systems (pZS199) is based on a vector which contains: 
(1) the chimeric gene nopaline synthase/neomycin 
phosphotransferase as a selectable marker for 
transformed plant cells (Bevan et al. # (1984) Nature 
20 304:184-186), (2) the left and right borders of the 
T-DNA of the Ti plasmid (Bevan et al., (1984) Nucl. 
Acids Res. 12:8711-8720), (3) the £. soli lacZ 
a-complementing segment (Vieria and Messing (1982) Gene 
19:259-267) with unique restriction endonuclease sites 
25 for Eco RI, Kpn I, Bam HI, Hin Dili, and Sal I, (4) the 
bacterial replication origin from the Pfieudomonas 
plasmid pVSl (Itoh et al., (1984) Plasmid 11:206-220), 
and (5) the bacterial neomycin phosphotransferase gene 
from Tn5 (Berg et al., (1975) Proc. Natnl. Acad. Sci. 
30 U;S.A. 72:3628-3632) as a selectable marker for 

transformed A. t-.nmefaelens. The nopaline synthase 
promoter in the plant selectable marker was replaced by 
the 35S promoter (Odell et al. (1985) Nature, 
313:810-813) by a standard restriction endonuclease 
35 digestion and ligation strategy. The 35S promoter is 
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required for efficient Brass ica napus transformation as 
described below. 

EXAMPLE 11 
AnRORACTERItJM MEDIATED TRANSFORMATION 
5 OF BRASS ICA NAPUS 

The binary vectors pZCC3FdR abd pZCCdFdR were 
transferred by a freeze/thaw method (Holsters et al., 
(1978) Mol Gen Genet 163:181-187) to the Aorbbacterium 
strain LBA4404/pAL4404 (Hoekema et al., (1983), Nature 
10 303:179-180) . 

Brass ica napus cultivar "Westar" was transformed by 
co-cultivation of seedling pieces with disarmed 
Aarobacteriun) tumefaciens strain LBA4404 carrying the 
the appropriate binary vector. 
15 a. napus seeds were sterilized by stirring in 10% 

Chlorox, 0.1% SDS for thirty min, and then rinsed 
thoroughly with sterile distilled water. The seeds were - 
germinated on sterile medium containing 30 mM CaCl2 and 
1.5% agar, and grown for six days in the dark at 24°C. 
20 Liquid cultures of Aorobacterium for plant 

transformation were grown overnight at 28°C in Minimal A 
medium containing 100 mg/L kanamycin. The bacterial 
cells are pelleted by centrifugation and resuspended at 
a concentration of 10 8 cells/mL in liquid Murashige and 
25 Skoog Minimal Organic medium containing 100 jlM 
acetosyringone . 

fi. napus seedling hypocotyls were cut into 5 mm 
segments which were immediately placed into the 
bacterial suspension. After 30 min, the hypocotyl 
30 pieces were removed from the bacterial suspension and 
placed onto BC-12 callus medium containing 100 JIM 
acetosyringone. The plant tissue and Aarobacteria were 
co-cultivated for three days at 24°C in dim light. 

The co-cultivation was terminated by transferring 
35 the hypocotyl pieces to BC-12 callus medium containing 
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200 ing/L carbenicillin to kill the AgrohflCtftriar and 
25 mg/L kanamycin to select for transformed plant cell 
growth. The seedling pieces were incubated on this 
medium for three weeks at 24°C under continuous light. 
5 After three weeks, the segments wre transferred to 

BS-48 regeneration medium containing 200 mg/L 
carbenicillin and 25 mg/L kanamycin. Plant tissue was 
subcultured every two weeks onto fresh selective 
regeneration medium, under the same culture conditions 
ID described for the callus medium. Putatively transformed 
calli grow rapidly on regeneration medium; as calli 
reached a diameter of about 2 mm, they were removed from 
the hypocotyl pieces and placed on the same medium 
lacking kanamycin. 
15 Shoots began to appear within several weeks after 

transfer to BS-48 regeneration medium. As soon as the 
shoots formed discernable stems, they were excised from 
the calli, transferred to MSV-1A elongation medium, and 
moved to a 16:8 h day/night photoperiod at 24°C. 
20 Once shoots had elongated several internodes, they 

were cut above the agar surface and the cut ends were 
dipped in Rootone. Treated shoots were planted directly 
into wet Metro-Mix 350 soiless potting medium. The pots 
were covered with plastic bags which were removed when 
25 the plants were clearly growing — after about 10 days . 

Plants were grown under a 16:8 h day/night photo- 
period, with a daytime temperature of 23°C and a 
nighttime temperature of 17°C. When the primary 
flowering stem began to elongate, it was covered with a 
30 mesh pollen-containment bag to prevent outcrossing. 

Self-pollination was facilitated by shaking the plants 
several times each day. Seeds derived from self- 
pollinations were harvested about three months after 
planting . 
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TABLE 9 



Minimal A Bacterial Growth Medium 

Dissolve in distilled water: 

10.5 g potassium phosphate, 
dibasic 

* 4.5 g potassium phosphate, 
monobasic 

1.0 g ammonium sulfate 

0.5 g sodium citrate, dihydrate 

Make up to 979 mLs With 
distilled water 

Autoclave 

Add 20 mLs filter-sterilized 
10% sucrose 

Add 1 mL filter-sterilized 
1 M MgS<>4 



Brassica Callus Medium BC-12 
Per liter: 

Murashige and SJcoog Minimal 
Organic Medium (MS salts, 100 
mg/L i -inositol, 0.4 mg/L 
thiamine; GIBCO #510-3118) 
30 sucrose 
18 g mannitol 
1.0 mg/L 2,4-p 
3.0 mg/L Jcinetin 
0.6% agarose 
pH 5.8 



Brassica Shoot Elongation 
Medium MSV-1A 

Murashige and Skoog Minimal 
Organic Medium Gaxnborg B5 
Vitamins 

10 g sucrose 

0.6% agarose 
pH 5.8 



Brassica Regeneration Medium BS-48 
Murashige and Skoog Minimal 
Organic Medium Gamborg B5 Vitamins 
(SIGMA #1019) 
10 g glucose 
250 mg xylose 
600 mg MES 

0.4% agarose 
pH 5.7 

Filter-sterilize and add after 
autoclaving: 

2.0 mg/L zeatin 

0.1 mg/L IAA 

EXAMPLE 12 

ANALYSTS OF TRANSGEN IC BRASSICA NAPUS PLANTS 

Insertion of the intact antisense transcriptional 
unit was; verified by Southern analysis using transgenic 
plant leaf tissue as the source of DNA as described in 
Example 5. Ten micrograms of leaf DNA was digested to 
completion with a mixture of Bam HI and Sal I 
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restriction endonucleases and then separatd by agarose 
gel electrophoresis. The separated DNA was transferred 
to Hybond H + membrane and hybridized with radiolabeled 
insert f rom pBNSF3-2 . Ah estimate of the number of 
5 copies of the inserted transgene was made by calibrating 
each Southern blot with standard amounts of pBNSF3-2 
corresponding to 1 and 5 copies per genome and comparing 
intensities of the autoradiographic signal from the 
standards, the endogenous delta-15 desaturase signals 
10 and the inserted gene signal* To date, 38 independent 
transformants have been analyzed for presence of the 
gene and 36 were found to be positive. 

The relative content of the 5 most abundant fatty 
acids in canola seeds was determined either by direct 
15 trans-esterification of individual seeds in 0.5 mL of 
itiethanolic H2SO4 (2.5%) or by hexane extraction of bulk 
seed samples followed by trans-esterif ication of an 
aliquot in 0.8 mL of 1% sodium methoxide in methanol. 
Fatty acid methyl esters were extracted from the 
20 methanolic solutions into hexane after the addition of 
an equal volume of water. 

The relative content of" 18:3 fatty acid varies 
significantly during seed development. To a lesser 
extent, the ratio of 18:3 to 18:2 varies also. Thus 
25 meaningful data can be obtained only from seeds after 

maturation and drydown. Additionally, the ratio of 18:3 
to total fatty acid content and to 18:0 varies 
significantly due to environmental factors, primarily 
temperature* In this circumstance, the most appropriate 
30 controls are the transformed plants which by Southern 
analysis do not contain* the antisense delta-15 
transgene. Analysis from the first 5 transformants to 
reach dry seed are given in Table 10 below. Seeds were 
harvested using a hand thesher, bulked and a 1.5 g 
35 (about 300 seeds) sample was taken. Seed from e^ch 
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transformant was crushed with a mortar and pestel, 
extracted 4 times with 8 mL hexane at about 50°C. The 
combined extracts were reduced in volume to 5 mL and two 
50 microliter aliquofcs were taken for esterif ication as 
5 described above. Separation of the fatty acid methyl 
esters was done by gas-liquid chromatography using ah 
Omegawax 320 column (Supelcp Inc., 0.32 mm ID X 30M) run 
isothermally at 220° and cycled to 260° between each 
injection. 

TABLE 1 0 

Antiaenae 



Tranaformant flpilta-IS 

Usl. \ 18:3 tlB;3/IB:2 Capy No. 

pZCC3FdR-91 6.2 0.39 0 

pZCC3FdR-81 5.9 0.33 1 

pZCC3FdR-15 6.0 0.38 2 

pZCC3FdR-l 1 5.6 0.34 1 

pZCC3FdR-14B 8.2 0,40 2 



10 The differences between the 4 transformed lines and 

line 92 are very small, however to test the significance 
of the difference in the -18:3/18:2 ratio between line 81 
arid 91, 25 individual seeds from each line were trans- 
esterified and their fatty acid composition determined. 
15 The average ratio for line 81 was 0.345 with a 

coefficient of viariation of 11.6% while the average for 
line 91 was 0.375 with a coefficient of variation of 
8.0%. The sample means are significantly different at 
the 0.01% level using Student's t test. 
20 EXAMPLE 13 

CONSTRUCTION OF VECTORS FOR TRANSFORMATION OF 
GLYCINE MAX FOR REDUCED EXPRESSION OF DELTA- 15 
DESATURASES IN DEVELOPING SEEDS 

The antisense £. max plastid delta-15 desaturase 
25 cDNA sequence under control of the {J-conglycinin 
promoter was constructed using the vector pCW109A 
described in Example 10 above. For use in the soybean 
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transformation system described below, the trans- 
criptional unit was placed in a vector along with an 
appropriate selectable marker expression system. The 
starting vector was pML45, which consists of the non^- 
5 tissue specific and constitutive promoter designated 
508D and described in Hershey (WO 9011361) driving 
expression of the neomycin phosphotransferase gene 
described in (Beck et al. (1982) Gene 19:327-336) 
followed by the 3' end of the nopaline synthase gene 

10 including nucleotides 848 to 1550 described by (Depicker 
et al. (1982) J. Appl. Genet. 1:561-574) . This 
transcriptional unit was inserted into the commercial 
cloning vector pGEM9Z (BRL) and is flanked at the 5 9 end 
of the 508D promoter by the restriction sites Sal I, Xba 

15 I, Bam HI and Sma I in that order. An additional Sal I 
site is present at the 3' end of the NOS 3' sequence and 
the Xba I, Bam HI and Sal I sites are unique. 

Removal of the unit [{J-conglycinin promter: cloning 

region :phaseolin 3' end] from pCW109A by digestion with 
20 Hind III, blunting the ends and isolating the 1.8 kB 
fragment afforded the expression cassette pCST by 
ligating the above isolated fragment into the Sma I site 
of pML45. A clone with the p-conglycinin promoter in 
the same orientation as the 508D promoter were chosen by 
25 digestion with Xba I. The correct orientation releases 
a 700 bp fragment. This vector cassette was called 
pCST* 

The 2.2 kB insert encoding the soybean, plastid 
delta-15 desaturase was siibcloned from the plasmid pXFl 

30 by digestion with HinP I to remove about 1 kB of 

unrelated cDNA. HinP I-cuts within the cDNA insert very 
near the 5' end of the cDNA for the delta-15 desaturase 
and about 300 bp from the 3' end of that cDNA. The Cla 
I compatable ends were cloned into Cla I digested 

35 pBluescript and a clone with the 5' end of the cDNA 
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toward the Eco RV site in the pBluescript cloning region 
was selected based on the relaese of a 900 bp fragment 
by digestion with Pst I. The subcloned plasmid was 
called pS3Fdl. 

5 The delta-15 encoding sequence was removed from 

pS3Fdl by digestion with HinC II and Eco RV, the 2.2 kB 
fragment was gel isolated and cloned into the opened Sma 
I site in pCSTl. A clone with the delta-15 sequence in 
the antisense orientation to the p-conglycinin promoter 

10 was selected by digestion with Xba I. The antisense 
construct releases a 400 bp piece and that clone was 
designated pCS3FdSTlR. 

EXAMPLE 14 

TRANSFORMATION OF SOMATIC SOYBEAN EMBRYO CULTURES 

15 Soybean embryogenic suspension cultures are 

maintained in 35 mL liquid media (SB55 or SBP6) on a 
rotary shaker, 150 rpm, at 28°C with mixed f lorescent 
and incandescent lights on a 16:8 h day /night schedule. 
Cultures were subcultured every four weeks by 

20 inoculating approximately 35 mg of tissue into 35 mL of 
liquid medium i 

Soybean embryogenic suspension cultures were 
transformed with pCS3FdST!R by the method of particle 
gun bombardment (see Kline et al. (1987) Nature (London) 

25 327:70). A Du Pont Biolistic PDS1000/HE instrument 
(helium retrofit) was used for these transformations. 

To 50 mL of a 60 mg/mL 1 mm gold particle 
suspension was added (in order); 5 \1L DNA ( 1 \lg/\lL) , 20 
|IL spermidine (0.1M), and 50 \ll CaCl2 (2.5 M) . The 

30 particle preparation was agitated for 3 min, spun in a 
microfuge for 10 sec and -the supernatant removed. The 
DNA-coated particles were then washed once in 400 p,L 70% 
ethanol and resuspended in 40 jtL of anhydrous ethanol. 
The DNA/particle suspension was sonicated three times 
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for 1 sec each. Five of the DNA-coated gold 
particles were then loaded on each macro carrier disk. 

Approximately 300-400 rag of a four week old 
suspension culture was placed in an empty 60x15 mm petri 
5 dish and the residual liquid removed from the tissue 
with a pipette. For each transformation experiment, 
approximately 5-10 plates of tissue were normally 
bombarded. Membrane rupture pressure was set at 1000 
psi and the chamber was evacuated to a vacuum of 28 
10 inches of mercury. The tissue was placed approximately 
3.5 inches away from the retaining screen and bombarded 
three times. Following bombardment, the tissue was 
placed back into liquid and cultured as described above. 
Eleven days post bombardment, the liquid media was 
15 exchanged with fresh SB55 containing 50 mg/raL 

hygromycin . The selective media was refreshed weekly. 
Seven weeks post bombardment, green, transformed tissue 
was observed growing from untransformed, necrotic 
embryogenic clusters. Isolated green tissue was removed 
20 and inoculated into individual flasks to generate new, 
clonally propagated, transformed embryogenic suspension 
cultures. Thus each new line was treated as independent 
transformation event. These suspensions can then be 
maintained as suspensions of embryos clustered in an 
25 immature developmental stage through subculture or 
regenerated into whole plants by maturation and 
germination of individual somatic embryos. 

Transformed embryogenic clusters were removed from 
liquid culture and placed on a solid agar media (SB103) 
30 containing no hormones or antibiotics. Embryos were 
cultured for eight weeks -at 26°C with mixed florescent 
and incandescent lights on a 16:8 h day /night schedule. 
During this period, individual embryos were removed from 
the clusters and analyzed at various stages of embryo 
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devielopment After eight weeks the embryos become 
suitable for germination. 

Media: B5 Vitimin Stpck 

SB55 and SBP6 Stock Solutions 10 g m-inositol 

100 mg nicotinic acid 
100 mg pyridoxine HC1 
1 g thiamine 
SB55 (per Liter) 

10 mL each MS stocks 
1 mL B5 Vitaimin stock 
0.8 g NH4KO3 
3.033 g KNO3 

1 mL 2,4-D {lOmg/mL stock) 
60 g sucrose 

0.667 9 asparagine 
pH 5.7 

For SBP6- substitute 0,5 mL 2,4-D 
SB103 <per Liter) 
MS Salts 

6% maltose 
750 mg MgGl 2 

0.2% Gelrite 
pH 5 .7 

EXAMPLE 15 

ANALYSTS OP TRANSGENIC GLYCINE MAX PLANTS 

5 While in the globular embryo state in liquid 

culture as described in Example 1.4, somatic soybean 
embryos contain very low amounts of triacylglycerol or 
storage proteins typical of maturing, zygotic soybean 
embryos* At this developmental stage, the ratio of 
10 total triacylglyceride to total polar lipid 

(phospholipids and glycolipid) is about 1:4, as is 
typical of zygotic soybean embryos at the developmental 



(g/L) : 

MS Sulfate 100X Stock 

MgS0 4 7H20 37.0 

MnS0 4 H2Q 1.69 

2riS0 4 7H20 0.86 

CUSO4 5R20 0.0025 

MS Haiides 100X Stock 

CaCl 2 2H 2 0 44.0 

KI 0.083 

CoCl 2 6H 2 0 0.00125 

KH2PO4 17.0 

H3BO3 0.62 

Na 2 Mo0 4 2H 2 0 0.025 

MS FeEDTA 100X Stock 

Na 2 £OTA 3.724 

FeS0 4 7H 2 0 2.784 
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stage from which the somatic embryo culture was 
initiated. At the globular stage as well, the mRNAs for 
the prominant seed proteins (a* subunit of 
p-conglycinin, Kunitz Trypsin Inhibitor III and Soybean 
5 Seed Lectin) are essentially absent . Upon transfer to 
hormone free media to allow differentiation to the 
maturing somatic embryo state as described in Example 
14, triacylgiycerol becomes the most abundant lipid 
class. As well, mRNAs for a '-subunit of p-conglycinin, 

10 Kunitz Trypsin Inhibitor III and Soybean Seed Lectin 
become very abundant messages in the total mRNA 
population. In these respects the somatic soybean 
embryo system behaves very similarly to maturing zygotic 
soybean embryos in vivo, and is therefore a good and 

15 rapid model system for analyzing the phenotypic effects 
of modifying the expression of genes in the fatty acid 
biosynthesis pathway. Similar somatic embryo culture 
systems have been documented and used in another oilseed 
crop, rapeseed (Taylor et al. (1990) Planta 181:18-26). 

20 Fatty acid analysis was performed as described in 

Example 12 using single embryos as the tissue source. A 
number of embryos from line 2872 (control tissue 
transformed with pCST) and lines 299,303,306 and 307 
(line 2872 transformed with plasmid P CS3FdSTlR) were 

25 analyzed for fatty acid content. The relative fatty- 
acid composition of embryos taken from tissue 
transformed with pCS3FdSTlR was compared with control 
tissue, transformed with pCST. The results of this 
analysis are shown in Table 12. 









TABLE 12 








Line 


fftnhryo 


16:0 


• IfitO 


18:1 


19:2 


?8:3 


2872 


1 


17.7 


4.1 


11.3 


52.8 


14.1 




2 


17.3 


4.3 


10.9 


49.5 


18.0 




3 


16.1 


4.1 


13.8 


48.2 


17.3 




4 


17.5 


3.6 


11.7 


52.0 


14.1 



WO 93/1 1245 



PCT/US92/ 10284 



123 





Embrvo 


16:0 


lfl:0 


18:1 


18:2 


1.8:3 




5 


16.6 


3.9 


.12.7 


53.7 


12.6 




6 


14.8 


3.0 


14.7 


55.3 


11.1 




av 


16.7 


3.8 


12.5 


51.9 


14.5 


299-1-3 


1 


16.5 


4.1 


9.7 


61.4 


6.3 


299-is-i 


1 


14.7 


3.6 


11.9 


61.3 


8.4 




2 


16.6 


3.7 


12.1 


58.6 


8.6 




3 


16.7 


4.1 


14.9 


53.2 


11 l 




4 


15.2 


4.0 


9.1 


60.2 


11 . 5 




5 


16.0 


4.2 


13.9 


55 . 2 


10 . 7 




6 


15.2 


3.5 


9.9 


63.4 


8.1 


303-7-1 


1 


14.1 


2.2 


10.6 


59.4 


13 . 7 




2 


14.0 


2.8 


12.5 


59. 3 


11 . 4 


306-4-5 




17. "5 


4.2 


8.1 


62. 7 


1 4 




215 7 

.1. J » f 


j « j 


9.0 


60 . 5 


11 5 






3 


17.1 


3.4 


9.3 


60.7 


Q 5 
j • j 


- 


4 


15 7 


3 ft 


j . &• 


61 2 






c 

J 


17 7 


j . 7 


£ 5 

O.J 


JO. J 


J. J . O 






16.6 


3.4 


10.2 


59.2 


10.6 


306-4-8 


1 


16. 6 


3. 9 


15.3 


50. 7 


11.8 




2 


17.8 


3.6 


15.7 


50.0 


10.8 




3 


16.7 


3.3 


11.1 


52. 0 


14 . 6 




4 


19.0 


4.0 


10.3 


53.1 


12. 3 




5 


19.7 


3.5 


9.0 


53. 6 


13.0 




6 


18.0 


2* '9. 


13.1 


52.8 


10.9 


307-1-1 


1 


14.4 


3.7 


11.2 


64.4 


6.3 




2 


15.4 


3.4 


7.8 


61.0 


11.3 




3 


17.2 


2. 5 


12. 0 


57.2 


11.1 


307-1-2 


1 


13.4 


3.0 


8.4 


55.4 


19.9 




2 


16.3 • 


3.1 


6.4 


55.7 


18.7 




3 


14.0 


3.3 


6.8 


58.7 


15.2 




4 


15.8 


2.5 


9.8 


59.7 


12.2 




5 


14.6 


3.7 


14.9 


51.1 


15.7 




6 


14.3 


3.9 


11.4 


55.5 


14.1 
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18; 0 


1B;1 


lti;2 


18:3 


307—1-3 


x 


14.8 


3.1 


9.4 


60.5 


12.2 






18.0 


3.0 


5.3 


56.2 


15.2 




3 


18.0 


3.4 


2.5 


58.6 


15.4 


307-1-4 


1 


15.0 


2.7 


13.8 


61.7 


6.9 




2 


15.9 


2.7 


9.8 


62.0 


9.6 




3 


14.6 


3.2 


13.4 


61.4 


6.7 


307—1-5 


1 


15.9 


3,5 


7.6 


61.7 


11.2 




2 


14.6 


3.5 


10.0 


61.3 


10.6 




3 


18.7 


2.6 


6.8 


53.0 


19.0 


3fl7— l —7 


1 


15.3 


3.5 


12.5 


60.3 


8.5 






16.2 


2.2 


13.9 


57.1 


10.6 






14.9 


3.1 


12.2 


58.0 


11.8 


"SAT— 1 —0 




16.4 


2.9 


23.2 


47.9 


9.6 




2 


19.6 


0.0 


20.4 


51.3 


8.8 




3 


16.6 


3.3 


24.6 


49.6 


5.7 


307-1-11 


1 


18.1 


3.6 


5.7 


52.9 


19.7 




2 


14.7 


3.7 


9.9 


58.7 


13.0 




3 


15.1 


3.7 


11.3 


55.8 


14.1 


The 


average 


18:3 


content 


of control 


embryos 


was 



14.5% with a range from 11.1% to 18.0%. The average 
18:3 content of transformed embryos was 11.5% with a 
5 range of 6.3% to 19.9%. Almost 80% of the transformed 
embryos (38/48) had an 18:3 content below that of the 
control mean. About 44% had an 18:3 content less than 
the lowest observed control value and 12.5% had an 18:3 
content less than half of the control mean value (i.e., 

10 less than 7.5%). The lowest 18:3 content observed in 
transformed tissue was 6.3% (299-1-3, 307-1-2 #1) 
compared with the control, low of 11.1%. In all cases in 
transformed tissue, a decrease in 18:3 content was 
reflected by an equivalent increase in 18:2 content 

15 indicating that the desaturation of 18:2 to 18:3 had 
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been reduced. The relative content of the the other 
fatty acids remained unchanged. 

Southern analysis for the presence of the intact/ 
introduced antisense construction was performed, as 
5 described in Example 12 using Bam HI cut gDNA, on a 
number of the transformed lines listed below using 
groups of embryos from a single transformation event . 
The approximate intact antisense copy number was 
estimated from the number and intensity of hybridizing 



bands on 


the autoradiograms and is 


shown in 


Table 13. 






TABLE 13 






Line No* 


Antisense 


18:3 


18:3 


18:2/18:3 


copy NO. 


.(low ) 




ratio 


2072 


0 


11.1 


14.5 


3.6 


303-7/1 


1 


11.4 


12.6 


4.7 


307-1/2 


3 


12.2 


16.0 


3.5 


306-4/8 


3 


10.8 


12.2 


4.3 


307-1/7 


4 


8.5 


10.3 


5.7 


306-4/5 


6 


7.4 


10.4 


5.8 


307-1/1 


6 


6.3 


9.6 


6.3 


299-15/1 


7 


8.1 


9*7 


6.1 


307-1/4 


8 


6.7 


7.7 


8.0 



There was a reasonable correlation between intact 
antisense copy number and 18:3 content, an increase in 
copy number correlating with a decreased 18:3 content 
and a consequent increase in the 18:2/18:3 ratio. The 
15 average 18:2/18:3 ratio of line 307-1/4., which had at 
least 8 copies of the antisense cDNA, was more than 
twice that of the control. 
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(i) APPLICANTS: Browse, John, Kinney, Anthony J., 
Pierce, John, Wierzbicki, Anna M., 
Yadav, Narendra S., Perex-Grau, Luis 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana IMMEDIATE SOURCE: 

(B) CLONE: pCF3 

(ix) FEATURE: 

<A) NAME/KEY: CDS 

(B) LOCATION: 46. ,1206 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CTCTCTCTCT CTCTCTTCTC TCTTTCTCTC CCCCTCTCTC CGGCG ATG GTT GTT 54 

Met Val val 
1 

GCT ATG GAC CAA CGC ACC AAT GTG AAC GGA GAT CCC GGC GCC GGA GAC 102 
Ala Met Asp Gin Arg Thr Asn Val Asn Gly Asp Pro Gly Ala Gly Asp 
5 10 15 

CGG AAG AAA GAA GAA AGG TTT GAT CCG AGT GCA CAA CCA CCG TTC AAG 150 
Arg Lys Lys Glu Glu Arg Phe Asp Pro Ser Ala Gin Pro Pro Phe Lys 
20 - 25 30 35 

ATC GGA GAT ATA AGG GCG GCG ATT CCT AAG CAC TGT TGG GTT AAG AGT 198 
lie Gly Asp lie Arg Ala Ala lie Pro Lys His Cys Trp Val Lys Ser 
40 4S 50 

CCT TTG AGA TCA ATG AGT TAC GTC GTC AGA GAC ATT ATC GCC GTC GCG 246 
Pro Leu Arg Ser Met Ser Tyr Val Val Arg Asp lie lie Ala val Ala 
55 60 65 

GCT TTG GCC ATC GCT GCC GTG TAT GTT GAT AGC TGG TTC CTT TGG CCT 294 
Ala Leu Ala lie Ala Ala Val Tyr Val Asp Ser Trp Phe Leu Trp Pro 
70 75 80 

CTT TAT TGG GCC GCC CAA GGA ACA CTT TTC TGG GCC ATC TTT GTT CTC 342 
Leu- Tyr Trp Ala Ala Gin Gly Thr Leu Phe Trp Ala lie Phe Val Leu 
85 go 95 

GGC CAC GAC TGT GGA CAT GGG AGT ,TTC TCA GAC ATT CCT CTA CTG AAT 390 
Gly His Asp Cys Gly His Gly Ser Phe Ser Asp He Pro Leu Leu Asn 
100 105 HO 115 
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AGT GTG GTT GGT CAC ATT CTT CAT TCT TTC ATC CTC GTT CCT TAG CAT 438 
Ser Val Val Gly His He Leu His Ser Phe lie Leu Val Pro Tyr His 
120 125 130 

GGT TGG AGA ATA AGC CAG CGG AGA CAC CAC CAG AAC CAT GGC CAT GTT 486 
Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly His Val 
135 140 145 

GAA AAC GAC GAG TCA TGG GTT CGG TTA CCA GAA AGG GTG TAC AAG AAA 534 
Glu Asn Asp Glu Ser Trp Val Pro Leu Pro Glu Arg Val Ty* Lys 
ISO 155 160 

TTG CCC GAC AGT ACT CGG ATG CTC AGA TAC ACT GTC CCT CTC CCC ATG 582 
Leu Pro His Ser Thr Arg Met Leu Arg Tyr Thr Val Pro Leu Pro Met 
165 170 175 

CTC GCA TAT CCT GTC TAT TTG TGC TAC AGA AGT CCT GGA AAA GAA GGA 630 
Leu Ala Tyr Pro Leu Tyr Leu Cys Tyr Arg Ser Pro Gly Lys Glu Gly 
180 185 190 195 

TCA CAT TTT AAC GGA TAC AGT AGT TTA TTT GCT CCA AGC GAG AGA AAG 678 
Ser His Phe Asn Pro Tyr Ser Ser Leu Phe Ala Pro Ser Glu Arg Lys 
200 205 210 

CTT ATT GCA ACT TCA ACT ACT . TGT TGG TCC ATA ATG TTC GTC AGT CTT 726 
Leu He Ala Thr Ser Thr Thr Cys Trp Ser He Met Phe Val Ser Leu 
215 220 225 

ATC GCT CTA TGT TTC GTC TTC GGT CCA CTC GCG GTT CTT AAA GTG TAC 774 
He Ala Leu Ser Phe Val Phe Gly Pro Leu Ala Val Leu Lys VaX Tyr 
230 235 240 

GGT GTA CCG TAC ATT ATC TTT GTG ATG TGG TTG GAT GCT GTC ACG TAT 822 
Gly Val Pro Tyr He He Phe Val Met Trp Leu Asp Ala Val Thr Tyr 
245 250 255 

TTG CAT CAT CAT GGT CAC GAT GAG AAG TTG CCT TGG TAT AGA GGC AAG 870 
Leu His His His Gly His Asp Glu Lys Leu Pro Trp Tyr Arg Gly Lys 
260 265 270 275 

GAA TGG AGT TAT CTA GGT GGA GGA TTA ACA AGA ATT GAT AGA GAT TAC 918 
Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr He Asp Arg Asp Tyr 
280 285 290 

GGA ATC TTT AAC AAC ATT CAT . GAC GAC ATT GGA ACT CAC GTG ATC CAT 966 
Gly lie Phe Asn Asn He His His Asp He Gly Thr His Val He His 
295 300 305 

CAT CTC TTC CCA CAA ATC CCT CAC TAT CAC TTG GTC GAC GCC ACG AAA 1014 
His Leu Phe Pro Gin He Pro His Tyr His Leu Val Asp Ala Thr Lys 
310 315 * 320 

GCA GCT AAA CAT GTG TTG GGA AGA TAC TAC AGA GAA CCA AAG ACG TCA 1062 
Ala Ala Lys His Val Leu Gly Arg Tyr Tyr Arg Glu Pro Lys Thr Ser 
325 330 335 
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AAA GAT CAT TAG GTC AGC GAC ACT. GOT GAT ATT GTC TTC TAC GAG ACA 

Lvs Aao His Tvr Vai e~i- * _ u 1A( - «M» ACA 
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365 370 

IS £ £ S £ E £ S £ SS S E £ S a r ™ 

375 380 3as 

TTGTTTAGCT CTATTAGGAA TAAACCAGCC CACTTTTAAA ATTTTTATTT CTTGTTGTTT 
TTAAGTTAAA AGTGTACTCG TGAAACTCTT TTTTTTTTCT TTTTTTTTAT TAATGTATTT 
ACATTACAAG GCGTAAA 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERI S TICS : 

(A) LENGTH; 386 amino acids 

(B) TYPE; amino acid 
<D) TOPOLOGY; linear 

(xi) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Val Ala Met Asp Gin Arg Thr Asn Val Asn Gly Asp Pro Gly 

3 10 



15 



Ala Gly Asp Arg Lys Lys Glu Glu Arg Phe Asp Pro Ser Ala Gin Pro 

20 25 30 

Pro Phe Lys u. «, as P Ile Arg Ma Ma u- pr<> Ly? ^ ^ ^ 



Val Lys ser Pro Leu Arg Ser Met Ser Tyr Val Val Arg Asp He lie 

55 go 

Ala Val Ala Ala Leu Ala lie Ala Ala Val Tyr Val Asp Ser Trp Phe 

Leu trp Pro Leu Tyr Trp Ala Ala Gin Gly Thr Leu Phe Trp Ala lie 

85 9 ° 95 

Phe Vai Le U Um Asp Cyg Gly ^ ^ y ^ ^ ^ ^ ^ ^ 

Leu Leu Asn Ser Val Val Gly His lie Leu His Ser Phe lie Leu Val 

120 125 

Pro Tyr His Gly Trp Arg lie Ser His Arg Thr His His Gin Asn His 

135 140 



1273 
1333 
1350 
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Gly His Val Glu Asn Asp Glu Ser Trp Val Pro Leu Pro Glu Arg Val 
!45 150 155 160 

Tyr Lys Lys Leu Pro His Ser Thr Arg Met Leu Arg Tyr Thr Val Pro 
165 170 175 

Leu Pro Met Leu Ala Tyr Pro Leu Tyr Leu Cys Tyr Arg Ser Pro Gly 
1B0 185 190 

hys Glu Gly Ser His Phe Asn Pro Tyr Ser Ser Leu Phe Ala Pro Ser 
195 200 205 

Glu Arg Lys Leu lie Ala Thr Ser Thr Thr Cys Trp Ser lie Met Phe 
210 215 220 

Val Ser Leu lie Ala Leu Ser Phe Val Phe Gly Pro Leu Ala Val Leu 
225 230 235 240 

Lys Val Tyr Gly Val Pro Tyr lie lie Phe Val Met Trp Leu Asp Ala 
245 250 255 

Val Thr Tyr Leu His His His Gly His Asp Glu Lys Leu Pro Trp Tyr 
260 265 270 

Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr He Asp 
275 280 285 

Arg Asp Tyr Gly lie Phe Asn Asn lie His His Asp lie Gly Thr His 
290 295 300 

Val lie His His Leu Phe Pro Gin lie Pro His Tyr His Leu Val Asp 
305 310 315 320 

Ala Thr Lys Ala Ala Lys His Val Leu Gly Arg Tyr Tyr Arg Glu Pro 
325 330 335 

Lys Thr Ser Gly Ala He Pro He His Leu Val Glu Ser Leu Val Ala 
340 345 350 

Ser lie Lys hys Asp His Tyr Val Ser Asp Thr Gly Asp lie Val Phe 
355 360 365 

Tyr Glu Thr Asp Pro Asp Leu Tyr Val Tyr Ala Ser Asp Lys Ser Lys 
370 375 380 

lie Asn 
385 

(2) . INFORMATION FOR SEQ ID N0:3:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: pFl 
(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 68.. 255 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AAATTCATCA AACCCTTTCT TCACCACATT ATTTTCACTG AGCGCATAAC ATTTTTGAGA 60 

CAAGAGACTC TCTCTCTCTC TCTTCTCTCT TTCTCTCCCC CTCTCTCCGG CGATGGTTGT 120 

TGCTATGGAC CAACGCACCA ATGTGAACGG AGATCCCGGC GCCGGAGACC GGAAGAAAGA 180 

AGAAAGGTTT GATCCGAGTG CACAACCACC GTTCAAGATC GGAGATATAA GGGCGGCGAT 24 0 

TCCTAAGCAC TGTTG 255 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1525 base pairs 

KB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: pACF2-2 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 10.. 1350 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CAAGTTCTA ATG GCG AAC TTG GTC TTA TCA GAA TGT GGT ATA CGA CCT 48 
Met Ala Asn Leu Val Leu Ser Glu Cys Gly lie Arg Pro 
1 5 1° 

CTC CCC AGA ATC TAC ACA ACA CCC AGA TCC AAT TTC CTC TCC AAC AAC 
Leu Pro Arg lie Tyr Thr Thr Pro Arg Ser Asn Phe Leu Ser Asn Asn 
15 20 25 

AAC AAA TTC AGA CCA TCA CTT TCT TCT TCT TCT TAC AAA ACA TCA TCA 
Asn Lys Phe Arg Pro Ser Leu Ser Ser Ser Ser Tyr Lys Thr Ser Ser 
30 35 40 45 

192 



95 



96 



144 



TCT CCT CTG TCT ITT GGT CTG AAT TCA CGA GAT GGG TTC ACG AGG AAT 
Ser Pro Leu Ser Phe Gly Leu Asn Ser Arg Asp Gly Phe Thr Arg Asn 
SO 55 »<» 

TGG GCG TTG AAT GTG AGC ACA CCA TTA ACG ACA CCA ATA TTT GAG GAG 
Tip Ala Leu Ash Val Ser Thr Pro Leu Thr Thr Pro lie Phe Glu Glu 
65 70 75 

TCT CCA TTG GAG GAA GAT AAT AAA CAG AGA TTC GAT CCA GGT GCG CCT 
Ser Pro Leu Glu Glu Asp Asn Lys Gin Arg Phe Asp Pro Gly Ala Pro 
80 85 90 

CCT CCG TTC AAT TTA GCT GAT ATT AGA GCA GCT ATA CCT AAG CAT TGT 336 
Pro Pro Phe Asn Leu Ala Asp He Arg Ala Ala lie Pro Lys His Cys 

100 105 



240 



286 



384 



432 



480 



TGG GTT AAG AAT CCA TGG AAG TCT TTG AGT TAT GTC GTC AGA GAC GTC 
Trp Val Lys Asn Pro Trp Lys Ser Leu Ser Tyr Val Val Arg Asp Val 
1W 115 120 125 

GCT ATC GTC TTT GCA TTG GCT GCT GGA GCT GCT TAC CTC AAC AAT TGG 
Ala He Val Phe Ala Leu Ala Ala Gly Ala Ala Tyr Leu Asn Asn Trp 
130 135 140 

ATT GTT TGG CCT CTC TAT TGG CTC GCT CAA GGA ACC ATG TTT TGG GCT 
He Val Trp Pro Leu Tyr Trp Leu Ala Gin Gly Thr Met Phe Trp Ala 
145 150 155 

CTC TTT GCT CTT GGT CAT GAC TGT GGA CAT GGT AGT TTC TCA AAT GAT 528 
Leu Phe Vai Leu Gly His Asp Cys Gly His Gly Ser Phe Ser Asn Asp 
160 I" 170 

CCG AAG TTG AAC AGT GTG GTC GGT CAT CTT CTT CAT TCC TCA ATT CTG 576 
Pro Lys Leu Asn Ser Val Val Gly His Leu Leu His Ser Ser lie Leu 
175- 180 I 85 

GTC CCA TAC CAT GGC TGG AGA ATT AGT CAC AGA ACT GAC CAC CAG AAC 624 
val Pro Tyr His Gly Trp Arg lie Ser His- Arg Thr Rxs His Gin Asn 
190 195 200 205 

CAT GGA CAT GTT GAG AAT GAC GAA TCT TGG CAT CCT ATG TCT GAG AAA 672 
His Gly His Val Glu Asn Asp Glu Ser Trp His Pro Met Ser Glu Lys 
210 215 220 
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ATC TAC AAT ACT TTG GAC AAG CCG ACT AG A TTC TTT AGA TTT ACA CTG 720 
He Tyr Asn Thr Leu Asp Lys Pro Thr Arg Phe Phe Arg Phe Thr Leu 
225 230 235 

CCT CTC GTG ATG CTT GCA TAC CCT TTC TAC TTG TGG GCT CGA AGT CCG 768 
Pro Leu Val Met Leu Ala Tyr Pro Phe Tyr Leu Trp Ala Arg Ser Pro 
240 245 250 

GGG AAA AAG GGT TCT CAT TAC CAT CCA GAC AGT GAC TTG TTC CTC CCT 616 
Gly Lys Lys Gly Ser His Tyr His Pro Asp Ser Asp Leu Phe Leu Pro 
255 260 265 



AAA GAG AGA AAG GAT GTC CTC ACT TCT ACT GCT TGT TGG ACT GCA ATG 864 
Lys Glu Arg Lys Asp Val Leu Thr Ser Thr Ala Cys Trp Thr Ala Met 
270 275 280 285 

GCT GCT CTG CTT GTT TGT CTC AAC TTC ACA ATC GGT CCA ATT CAA ATG 912 
Ala Ala Leu Leu Val Cys Leu Asn Phe Thr He Gly Pro lie Gin Met 
290 295 300 

CTC AAA CTT TAT GGA ATT CCT TAC TGG ATA AAT GTA ATG TGG TTG GAC 960 
* Leu Lys Leu Tyr Gly He Pro Tyr Trp He Asn Val Met Trp Leu Asp 
305 310 315 



TTT GTG ACT TAC CTG CAT CAC CAT GGT CAT GAA GAT AAG CTT CCT TGG 1008 
Phe Val Thr Tyr Leu His His His Gly His Glu Asp Lys Leu Pro Trp 
320 325 330 

TAC CGT GGC AAG GAG TGG AGT TAC CTG AGA GGA GGA CTT ACA ACA TTG 1056 
Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Leu 
335 340 345 

GAT CGT GAC TAC GGA TTG ATC AAT AAC ATC CAT CAT GAT ATT GGA ACT 1104 
Asp Arg Asp Tyr Gly Leu He Asn Asn lie His His Asp lie Gly Thr 
350 * 355 360 365 

CAT GTG ATA CAT CAT CTT TTC CCG CAG ATC CCA CAT TAT CAT CTA GTA 1152 
His Val He His His Leu Phe Pro Gin He Pro His Tyr His Leu Val 
370 375 380 

GAA GCA ACA GAA GCA GCT AAA CCA GTA TTA GGG AAG TAT TAC AGG GAG 1200 
Glu Ala Thr Glu Ala Ala Lys Pro Val Leu Gly. Lys Tyr Tyr Arg Glu 
385 390 395 

CCT GAT AAG TCT GGA CCG TTG CCA TTA CAT TTA CTG GAA ATT CTA GCG 1248 
Pro Asp Lys Ser Gly Pro Leu Pro Leu His Leu Leu Glu He Leu Al 
400 405 410 

AAA AGT. ATA AAA GAA GAT CAT TAC GTG AGC GAC GAA GGA GAA GTT GTA 1296 
Lys Ser He Lys Glu Asp His Tyr Val Ser Asp Glu Gly Glu Val Val 
-415 420 * 425 

TAC TAT AAA GCA GAT CCA AAT CTC TAT GGA GAG GTC AAA GTA AGA GCA 1344 
Tyr Tyr Lys Ala Asp Pro Asn Leu Tyr Gly Glu Val Lys Val Arg Ala 
430 435 440 445 



GAT TGAAATGAAG CAGGCTTGAG ATTGAAGTTT TTTCTATTTC AGACCAGCTG 
Asp 



1397 
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ATTTTTTGCT TACTGTATCA ATTTATTGTG TCACCCACCA GAGAGTTAGT ATCTCTGAAT 1457 
ACGATCGATC AGATGGAAAC AACAAATTTG TTTGCGATAC TGAAGCTATA TATACCATAC 1517 
ATTGCATT 1525 

(2) INFORMATION FOR SEQ ID NO: 5: 

{i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 446 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Asn Leu Val Leu Ser Glu Cys Gly lie Arg. Pro Leu Pro Arg 
X 5 10 15 

lie Tyr Thr Thr Pro Arg Ser Asn Phe Leu Ser Asn Asn Asn Lys Phe 
20 25 30 

Arg Pro Ser Leu Ser Ser Ser Ser Tyr Lys Thr Ser Ser Ser Pro Leu 
35 40 45 

Ser Phe Gly Leu Asn Ser Arg Asp Gly Phe Thr Arg Asn Trp Ala Leu 
50 55 €0 

Asn Val Ser Thr Pro Leu Thr Thr Pro He Phe Glu Glu Ser Pro Leu 
65 70 75 80 

Glu Glu Asp Asn Lys Gin Arg Phe Asp Pro Gly Ala Pro Pro Pro Phe 
85 90 95 

Asn Leu Ala Asp lie Arg Ala Ala lie Pro Lys His Cys Trp Val Lys 
100 105 110 

Asn Pro Trp Lys Ser Leu Ser Tyr Val Val Arg Asp Val Ala He Val 
115 120 125 

Phe Ala Leu Ala Ala Gly Ala Ala Tyr Leu Asn Asn Trp lie Val Trp 
130 135 140 

Pro Leu Tyr Trp Leu Ala Gin Gly Thr Met Phe Trp Ala Leu Phe Val 
145 150 155 160 

Leu* Gly Jiis Asp Cys Gly His Gly Ser Phe Ser Asn Asp Pro Lys Leu 
165 170 175 

Asn Ser Val Val Gly His Leu Leu His Ser Ser He Leu Val Pro Tyr 
180 185 190 

His Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly His 
195 200 205 
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Vai Glu Asn Asp Glu Ser Trp His Pro Met Ser Glu Lys He Tyr Asn 
210 215 220 

Thr Leu Asp Lys Pro Thr Arg Phe Phe Arg Phe Thr Leu Pro Leu Val 
225 230 235 240 

Met Leu Ala Tyr Pro Phe Tyr Leu Trp Ala Arg Ser Pro Gly Lys Lys 
245 250 255 

Gly Ser His Tyr His Pro Asp Ser Asp Leu Phe Leu Pro Lys Glu Arg 
260 265 270 

Lys Asp Val Leu Thr Ser Thr Ala Cys Trp Thr Ala Met Ala Ala Leu 
275 280 285 

Leu Val Cys Leu Asa Phe Thr He Gly Pro He Gin Met Leu Lys Leu 
290 295 300 

Tyr Gly He Pro Tyr Trp He Asn Val Met Trp Leu Asp Phe Val Thr 
305 310 315 320 

Tyr Leu His His His Gly His Glu Asp Lys Leu Pro Trp Tyr Arg Gly 
325 330 335 

Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Leu Asp Arg Asp 
340 345 350 

Tyr Gly Leu He Asn Asn He His His Asp lie Gly Thr His Val He 
355 360 365 

His His Leu Phe Pro Gin He Pro His Tyr His Leu Val Glu Ala Thr 
370 375 380 

Glu Ala Ala Lys Pro Val Leu Gly Lys Tyr Tyr Arg Glu Pro Asp Lys 
385 390 395 400 

Ser Gly Pro Leu Pro Leu His Leu Leu Glu He Leu Ala Lys Ser He 
405 410 415 

Lys Glu Asp His Tyr Val Ser Asp Glu Gly Glu Val Val Tyr Tyr Lys 
420 425 430 



Ala Asp Pro Asn Leu Tyr Gly Glu Val Lys Val Arg Ala Asp 
435 440 445 



(2) INFORMATION FOR SEQ ID NO: 6: 

"(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1429 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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( iii ) HYPOTHETICAL : NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Brassica napus 
(vii) IMMEDIATE SOURCE: 

<B) CLONE : pBNSF3 -f 2 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 79 . . 1212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TTCAAATTCA GACAATCCCC TTCTTCTCCC CGGTTTCGTC TGAACTCTCG AAACTGGGCG 60 

TTGAATGTAA CCACACCT CTA ACA GTC GAC TCC TCA TCA TCT CCT CCA ATC 111 
Leu Thr Val Asp Ser Ser Ser Ser Pro Pro lie 
15 10 



GAG GAA GAA CCC AAA ACG CAG AGA TTC GAC CCA GGC GCT CCT CCT CCG 
Glu Glu Glu Pro Lys Thr Glh Arg Phe Asp Pro Gly Ala Pro Pro Pro 
15 20 25 

TTC AAC CTA GCT GAC ATC AGA GCG GCG ATA CCT AAG CAT TGC TGG GTT 
Phe Asn Leu Ala Asp lie Arg Ala Ala He Pro Lys His Cys Trp Val 
30 35 40 

AAG AAT CCA TGG AAG TCT ATG AGT TAC GTC GTC AGA GAG CTA GCC ATC 
Lys Asn Pro Trp Lys Ser Met Ser Tyr Val Val Arg Glu Leu Ala He 
45 50 55 

GTG TTC GCA CTA GCT GCT GGA GCT GCT TAC CTC AAC AAT TGG CTT GTT 
Val Phe Ala Leu Ala Ala Gly Ala Ala Tyr Leu Asn Asn Trp Leu Val 
£0 65 70 75 

TGG CCT CTC TAT TGG ATT GCT GAA GGA ACC ATG TTC TGG GCT CTC TTT 
Trp Pro Leu Tyr Trp He Ala Gin Gly Thr Met Phe Trp Ala Leu Phe 
80 85 90 

GTT CTT GGC CAT GAC TGT GGA CAT GGA AGC TTC TCA AAT GAT CCG AGG 
Val Leu Gly His Asp Cys Gly His Gly Ser Phe Ser Asn Asp Pro Arg 
95 100 105 

TTG AAC AGT GTG GTG GGT CAC CTT CTT CAT TCC TCT ATT CTA GTC CCT 
Leu Asn Ser Val Val Gly His Leu Leu His Ser Ser He Leu Val Pro 
110 115 120 

TAC CAT GGC TGG AGA ATT AGC CAC AGA ACT CAC CAC CAG AAC CAT GGA 
Tyr His Gly Trp Arg He Ser His Arg Thr His His Gin Asn His Gly 
125 130 135 

CAT GTT GAG AAC GAT GAA TCT TGG CAT CCT ATG TCT GAG AAA ATC TAC 
His Val Glu Asn Asp Glu Ser Trp His Pro Met Ser Glu Lys He Tyr 
140 145 150 155 



159 



207 



255 



303 



351 



399 



447 



495 



543 
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AAG ACT TTG GAC AAA.CCC ACT CGG TTC TTT AGA TTT ACA TTG CCT CTC 591 
Lys Ser Leu Asp Lys Pro Thr Arg Phe Phe Arg Phe Thr Leu Pro Leu 
160 165 170 

GTG ATG CTC GCT TAC CCT TTC TAC TTG TGG GCA AGA AGT CCA GGG AAG 639 
Val Met Leu Ala Tyr Pro Phe Tyr Leu Trp Ala Arg Ser Pro Gly Lys 
175 180 185 

AAG GGT TCT CAT TAC CAT CCA GAC AGC GAC TTG TTC CTT CCT AAA GAG 687 
Lys Gly Ser His Tyr His Pro Asp Ser Asp Leu Phe Leu Pro Lys Glu 
190 195 200 

AGA AAC GAT GTT CTC ACT TCT ACC GCT TGT TGG ACT GCA ATG GCT GTT 735 
Arg Asn Asp Val Leu Thr Ser Thr Ala Cys Trp Thr Ala Met Ala Val 
205 210 215 

CTG CTT GTC TGT CTC AAC TTC GTG ATG GGT CCA ATG CAA ATG CTC AAA 783 
Leu Leu Val Cys Leu Asn Phe Val Met Gly Pro Met Gin Met Leu Lys 
220 225 230 235 

CTT TAT GTC ATT CCT TAG TGG ATA AAT GTA ATG TGG TTG GAC TTT GTG 831 
Leu Tyr Val lie Pro Tyr Trp lie Asn Val Met Trp Leu Asp Phe Val 
240 245 250 

ACT TAC CTG CAT CAC CAT GGT CAT GAA GAT AAG CTC CCT TGG TAC CGT 879 
Thr Tyr Leu His His His Gly His Glu Asp Lys Leu Pro Trp Tyr Arg 
255 260 265 

GGG AAG GAA TGG AGT TAC TTG AGA GGA GGA CTT ACA ACA TTG GAC CGG 927 
Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Leu Asp Arg 
270 275 280 

GAC TAC GGA TTG ATC AAC AAC ATC CAT CAC GAC ATT GGA ACT CAT GTG 975 
Asp Tyr Gly Leu lie Asn Asn lie His His Asp He Gly Thr His Val 
285 290 295 

ATA CAT CAT CTT TTC CCT CAG ATC CCA CAT TAT CAT CTA GTA GAA GCA 1023 
•lie His His Leu Phe Pro Gin He Pro His Tyr His Leu Val Glu Ala 
300 305 310 315 

ACA GAA GCA GCT AAA CCA GTA TTA GGG AAG TAT TAT AGG GAG CCT GAT 1071 
Thr Glu Ala Ala Lys Pro Val Leu Gly Lys Tyr Tyr Arg Glu Pro Asp 
320 325 330 

AAG TCT GGA CCT TTG CCA TTA CAT TTA CTG GGA ATC TTA GCA AAA AGT 1119 
Lys Ser Gly Pro Leu Pro Leu His Leu Leu Gly lie Leu Ala Lys Ser 
335 340 345 

ATT AAA GAA GAT CAT TTT GTG AGC GAT GAA GGA GAT GTT GTA TAC TAT 1167 
lie Lys Glu Asp His Phe Val Ser Asp Glu Gly Asp Val Val Tyr Tyr 
350 355 360 

GAA GCA GAC CCT AAT CTC TAT GGA GAG ATC AAG GTA ACA GCA GAG 1212 
Glu Ala Asp Pro Asn Leu Tyr Gly Glu He Lys Val Thr Ala Glu 
365 370 375 

TGAAATGAAG CTGTCAGATT TATCTATTTC TGACCAGCTG ATTTTTTTTG CTTATTAATG 1272 
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TCAATTCATT GTGTTACCAT TATCTCTGAA TACAA7CAGA TGGAAACCCC AACTTTGTTT 1332 
TCAATACTTG AAGCTATATA TATATATATA TATGTAAGAT ACATTGTATT GTCATTAGAT 1392 
TCACCATTCT CAAGGTTCTT ATACAAAAAA AAAAAAA 1429 J 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 amino acids 

(B) TYPE: amino acid 
(D ) TOPOLOGY : linear 

<ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Leu Thr Val Asp Ser Ser Ser Ser Pro Pro He Glu Glu Glu Pro Lys 
1 5 10 15 

Thr Gin Arg; Phe Asp Pro Gly Ala Pro Pro Pro Phe Asn Leu Ala Asp 
20 25 30 

He Arg- Ala Ala He Pro Lys His Cys Trp Val Lys Asn Pro Trp Lys 
35 40 45 

Ser Met Ser Tyr Val Val Arg Glu Leu Ala He Val Phe Ala Leu Ala 
50 55 60 

Ala Gly Ala Ala Tyr Leu Asn Asn Trp Leu Val Trp Pro Leu Tyr Trp 
65 '70 75 80 

He Ala Gin Gly Thr Met Phe Trp Ala Leu Phe Val Leu Gly His Asp 
85 30 95 

Cys Gly His Gly Ser Phe Ser Asn Asp Pro Arg Leu Asn Ser Val Val 
100 105 HO 

Gly His Leu Leu His Ser Ser He Leu Val Pro Tyr His Gly Trp Arg 
115 120 125 

lie Ser His Arg Thr His His Gin Asn His Gly His Val Glu Asn Asp 
130 135 140 

Glu Ser Trp His Pro Met Ser Glu Lys He Tyr Lys Ser Leu Asp Lys 
145 150 155 160 

Pro" Thr Arg Phe Phe Arg Phe Thr Leu Pro Leu Val Met Leu Ala Tyr 
165 170 175 

Pro Phe Tyr Leu Trp Ala Arg Ser Pro Gly Lys Lys Gly Ser His Tyr 
180 185 190 

His Pro Asp Ser Asp Leu Phe Leu Pro Lys Glu Arg Asn Asp Val Leu 
195 200 205 
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Thr Ser Thr Ala Cys Trp Thr Ala Met Ala Val Leu Leu Val Cys Leu 
210 215 220 

Asn Phe Val Met Gly Pro Met Gin Met Leu Lys Leu Tyr Val He Pro 
225 230 235 240 

Tyr Trp He Asn Val Met Trp Leu Asp Phe Val Thr Tyr Leu His His 
245 250 255 

His Gly His Glu Asp Lys Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser 
260 265 270 

Tyr Leu Arg Gly Gly Leu Thr Thr Leu Asp Arg Asp Tyr Gly Leu He 
275 280 285 

Asn Asn lie His His Asp He Gly Thr His Val He His His Leu Phe 
290 295 300 

Pro Gin He Pro His Tyr His Leu Val Glu Ala Thr Glu Ala Ala Lys 
305 310 315 320 

Pro Val Leu Gly Lys Tyr Tyr Arg Glu Pro Asp Lys Ser Gly Pro Leu 
325 330 335 

Pro Leu His Leu Leu Gly He Leu Ala Lys Ser lie Lys Glu Asp His 
340 345 350 

Phe Val Ser Asp Glu Gly Asp Val Val Tyr Tyr Glu Ala Asp Pro Asn 
355 360 365 

Leu Tyr Gly Glu He Lys Val Thr Ala Glu 
370 375 



(2) INFORMATION FOR SEQ ID N0:8: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1429 base pairs 

(B) TYPE; nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDHA 
(iii) HYPOTHETICAL: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: pBNSFd-2 



WO 93/11245 PCT/US92/10284 

140 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: I.. 1215 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: £ 

TTC AAA TTC AGA CAA TCC CCT TCT TCT CCC CGG TTT CGT CTG AAC TCT 48 
Phe Lys Phe Arg Gin Ser Pro Ser Ser Pro Arg Phe Arg Leu Asn Ser 
1 5 10 15 

CGA AAC TGG GCG TTG AAT GTA ACC ACA CCT CTA ACA GTC GAC TCC TCA 96 
Arg Asn Trp Ala Leu Asn Val Thr Thr Pro Leu Thr Val Asp Ser Ser 
20 25 30 

TCA . TCT CCT CCA ATC GAG GAA GAA CCC AAA ACG CAG AGA TTC GAC CCA 144 
Ser Ser Pro Pro lie Glu Glu Glu Pro Lys Thr Gin Arg Phe Asp Pro 
35 40 45 

GGC GCT CCT CCT CCG TTC AAC CTA GCT GAC ATC AGA GCG GCG ATA CCT 192 
Gly Ala Pro Pro Pro Phe Asn Leu Ala Asp lie Arg Ala Ala lie Pro 
50 55 60 

AAG CAT TGC TGG GTT AAG AAT CCA TGG AAG TCT ATG AGT TAC GTC GTC 240 
Lys His Cys Trp Val Lys Asn Pro Trp Lys Ser Met Ser Tyr Val Val 
65 70 75 80 

AGA GAG CTA GCC ATC GTG TTC GCA CTA GCT GCT GGA GCT GCT TAC CTC 
Arg Glu Leu Ala lie Val Phe Ala Leu Ala Ala Gly Ala Ala Tyr Leu 
85 90 95 



TCT ATT CTA GTC CCT TAC CAT GGC TGG AGA ATT AGC CAC AGA ACT CAC 
Ser lie Leu Val Pro Tyr His Gly Trp Arg lie Ser His Arg Thr His 
145 150 155 160 



TTT ACA TTG CCT CTC GTG ATG CTC GCT TAC CCT TTC TAC TTG TGG GCA 
Phe Thr Leu Pro Leu Val Met Leu Ala Tyr Pro Phe Tyr Leu Trp Ala 
195 200 205 



288 



AAC AAT TGG CTT GTT TGG CCT CTC TAT TGG ATT GCT CAA GGA ACC ATG 336 
Asn Asn Trp Leu Val Trp Pro Leu Tyr Trp lie Ala Gin Gly Thr Met 
100 105 HO 

TTC TGG GCT CTC TTT GTT CTT GGC CAT GAC TGT GGA CAT GGA AGC TTC 384 
Phe Trp Ala Leu Phe Val Leu Gly His Asp Cys Gly His Gly Ser Phe 
115 120 125 

TCA AAT GAT CCG AGG TTG AAC AGT GTG GTG GGT CAC CTT CTT CAT TCC 432 
Ser Asn Asp Pro Arg Leu Asn Ser Val Val Gly His Leu Leu His Ser 
130 135 140 



480 



CAC CAG AAC CAT GGA CAT GTT GAG AAC GAT GAA TCT TGG CAT CCT ATG 528 
His Gin Asn His Gly Hia Val Glu Asn Asp Glu Ser Trp His Pro Met 
165 . 170 175 

TCT GAG AAA ATC TAC AAG AGT TTG GAC AAA CCC ACT CGG TTC TTT AGA 576 
Ser Glu Lys He Tyr Lys Ser Leu Asp Lys Pro Thr Arg Phe Phe Arg 
180 185 190 



624 
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AGA ACT CCA GGG AAG AAG GGT TCT CAT TAC CAT CCA. GAC AGC GAC TTG 672 
Arg Ser Pro Gly Lys Lys Gly Ser His Tyr His Pro Asp Ser Asp Leu 
210 215 220 

TTC CTT CCT AAA GAG AGA AAC GAT GTT CTC ACT TCT ACC GCT TGT TGG 720 
Phe Leu Pro Lys Glu Arg Asn Asp Val Leu Thr Ser Thr Ala Cys Trp 
225 230 235 240 

ACT GCA ATG GCT GTT GTG CTT GTC TGT CTC AAC TTC GTG ATG GGT CCA 769 
Thr Ala Met Ala Val Leu Leu Val Cys Leu Asn Phe Val Met Gly Pro 
245 250 255 

ATG CAA ATG CTC AAA CTT TAT GTC ATT CCT TAC TGG ATA AAT GTA ATG 816 
Met Gin Met Leu Lys Leu Tyr Val He Pro Tyr Trp He Asa Val Met 
260 265 270 

TGG TTG GAC TTT GTG ACT TAC CTG CAT CAC GAT GGT CAT GAA GAT AAG 864 
Trp Leu Asp Phe Val Thr Tyr Leu His His His Gly His Glu Asp Lys 
275 280 285 

CTC CCT TGG TAC CGT GGG AAG GAA TGG AGT TAC TTG AGA GGA GGA CTT 912 
Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu 
290 295 300 

ACA ACA TTG GAC CGG GAC TAC GGA TTG ATC AAC AAC ATC CAT CAC GAC 960 
Thr Thr Leu Asp Arg Asp Tyr Gly Leu lie Asn Asn He His His Asp 
305 310 315 320 

ATT GGA ACT CAT* GTG ATA CAT CAT CTT TTC CCT CAG ATC CCA CAT TAT 1008 
He Gly Thr His Val He His His Leu Phe Pro Gin lie Pro His Tyr 
325 330 335 

CAT CTA GTA GAA GCA ACA GAA GCA GCT AAA CCA GTA TTA GGG AAG TAT .1056 
His Leu Val Glu Ala Thr Glu Ala Ala Lys Pro Val Leu Gly Lys Tyr 
340 345 350 

TAT AGG GAG CCT GAT AAG TCT GGA CCT TTG CCA TTA CAT TTA CTG GGA 1104 
Tyr Arg Glu Pro Asp Lys Ser Gly Pro Leu Pro Leu His Leu Leu Gly 
355 360 365 

ATC TTA GCA AAA AGT ATT AAA GAA GAT CAT TTT GTG AGC GAT GAA GGA 1152 
He Leu Ala Lys Ser lie Lys Glu Asp His Phe Val Ser Asp Glu Gly 
370 375 380 

GAT GTT GTA TAC TAT GAA GCA GAC CCT AAT CTC TAT GGA GAG ATC AAG 1200 
Asp Val Val Tyr Tyr Glu Ala Asp Pro Asn Leu Tyr Gly Glu He Lys 
385 390 395 400 

GTA ACA GCA GAG TGAAATGAAG CTGTCAGATT TATCTATTTC TGACCAGCTG 1252 
Vai Thr Ala Glu 

405 

ATTTTTTTTG QTTATTAATG TCAATTCATT GTGTTACCAT TATCTCTGAA TACAATCAGA 1312 

TGGAAACCCC AACTTTGTTT TCAATACTTG AAGCTATATA TATATATATA TATGTAAGAT 1372 

ACATTGTATT GTCATTAGAT TCACCATTCT CAAGGTTCTT ATACAAAAAA AAAAAAA 1429 
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(2) INFORMATION FOR SEQ ID NO: 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Phe Lys Phe Arg Glri Ser Pro Ser Ser Pro Arg Phe Arg Leu Asn Ser 
1 5 10 15 

Arg Asn Trp Ala Leu Asn Val Thr Thr Pro Leu Tfar Vai Asp Ser Ser 
20 25 30 

Ser Ser Pro Pro lie Glu Glu Glu Pro Lys Thr Gin Arg Phe Asp Pro 
35 40 45 

Gly Ala Pro Pro Pro Phe Asn Leu Ala Asp lie Arg Ala Ala lie Pro 
50 55 60 

Lys His Cys Trp Val Lys Asn Pro Trp Lys Ser Met Ser Tyr Val Val 
65 70 75 80 

Arg Glu Leu Ala lie Val Phe Ala Leu Ala Ala Gly Ala Ala Tyr Leu 
85 90 95 

Asn Asn Trp Leu Val Trp Pro Leu Tyr Trp lie Ala Gin Gly Thr Met 
100 105 110 

Phe Trp Ala Leu Phe Val Leu Gly His Asp Cys Gly His Gly Ser Phe 
115 120 125 

Ser Asn Asp Pro Arg Leu Asn Ser Val Val Gly His Leu Leu His Ser 
130 135 140 

Ser lie Leu Val Pro Tyr His Gly Trp Arg lie Ser His Arg Thr His 
145 150 155 160 

His Gin Asn His Gly His Val Glu Asn Asp Glu Ser Trp His Pro Met 
165 170 175 

Ser Glu Lys He Tyr Lys Ser Leu Asp Lys Pro Thr Arg Phe Phe Arg 
180 185 190 

Phe Thr Leu Pro Leu Val Met Leu Ala Tyr Pro Phe Tyr Leu Trp Ala 
195 200 205 

Arg Ser Pro Gly Lys . Lys Gly Ser His Tyr His Pro Asp Ser Asp Leu 
210 215 220 

Phe Leu Pro Lys Glu Arg Asn Asp Val Leu Thr Ser Thr Ala Cys Trp 
225 230 235 240 
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Thr Ala Met Ala Val Leu Leu Val Cys Leu Asn Phe Val Met Gly Pro 
245 250 255 

Met Gin Met Leu Lys Leu Tyr Val He Pro Tyr Trp He Asn Val Met 
260 265 270 

Trp Leu Asp Phe Val Thr Tyr Leu His His His Gly His Glu Asp Lys 
275 280 285 

Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu 
290 295 300 

Thr Thr Leu Asp Arg Asp Tyr Gly Leu lie Asn Asn He His His Asp 
305 3X0 315 320 

lie Gly Thr His Val lie His His Leu Phe Pro Gin He Pro His Tyr 
325 330 335 

His Leu Val Glu Ala Thr Glu Ala Ala Lys Pro Val Leu Gly Lys Tyr 
340 345 350 

Tyr Arg Glu Pro Asp Lys Ser Gly Pro Leu Pro Leu His Leu Leu Gly 
355 360 365 

lie Leu Ala Lys Ser He Lys Glu Asp His Phe Val Ser Asp Glu Gly 
370 375 380 

Asp Val Val Tyr Tyr Glu Ala Asp Pro Asn Leu Tyr Gly Glu He Lys 
385 390 395 400 

Val Thr Ala Glu 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2181 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(iij MOLECULE TYPE: CDNA 
(iit) HYPOTHETICAL: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Glycine max 
(vii) IMMEDIATE SOURCE : 

(B) CLONE: pXFl 
(ix) FEATURE; 

(A) NAME/KEY: CDS 
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(B) LOCATION: 855. .1997 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ACAATAATAA ATCCATATTT TTATAATTAA AAGTAGTAGA TTACAGCGAT GCACTTGAGA 60 

AACATATTAA GTGGACTAAT TCTCCCTGGT CAAGCAAGAA AAAAACCAGC TATGACCCAA 120 

GGTAGAGAGA GATTATAGAC AGAATACTAG TAATTAACTA AGACTGGCTC TGCAATTGCC 180 

AAAAACTCCA TTGCAGTAGC AGCCACCTGA GAAGACACTA AGACCTAGAC TAGACCATAC 240 

ATATGAAGAT TAATACGCTT ACATAACAAC ATAGGACACT AAGAAAACAC GGCTTACAGA 300 

GAATCCAGCT GACTCTATAA GAGGGGTACT TCTGGAGATT AAAATTATCC GAATCACCTT 360 

CCCACTGCGG CTGCTGACGT CAGCGAAAGT CAGAACCGAA AGCGGCGAAG AACCTTCAGA 420 

AGAGGAGGAA GCACTTCGAC CTTACAAGAG TTGTTGTCGT TGTTGTTGTC GTTCTCTGGC 480 

GGAGAAGCGA GTTTGGATCG CGTTTTCCTC GGAGGCTTCT CGGTCTTCCC CTGTTTCTGC 540 

AGCTCAGCCA GGCCCTCGCA AATGGCCTGA AGCTTGGCGT CAACGGCGGA ATGAAGAGGC 600 

TAATACTCCC CGAAGTCACC ACCGACGGAG GAACCCTGGT GTCGGAGGTT GGGGAAGTTG 660 

AGCCTGGCGA AGTCACCTCG GAGCTTGTAC GCGGCCTTGT GGTACGCCAG AGCGGCTTCC 720 

TCGGCGGTGT CGAAGGTTCC CAGCCATAGC CTGGTCCGGA TTCTTCGGGA GTCTAATCTC 780 

AGCCACCCAC TTCCCCCCTG AGAAAAGAGA GGAACCACAC TCTCTAAGCC AAAGCAAAAG 840 

CAGCAGCAGC AGCA ATG GTT AAA GAC ACA AAG CCT TTA GCC TAT GCT GCC 
Met Val Lys Asp Thr Lys Pro Leu Ala Tyr Ala Ala 
1 5 10 



890 



AAT AAT GGA TAC CAA CAA AAG GGT TCT TCT TTT GAT TTT GAT CCT AGC 938 
Asn Asn Gly Tyr Gin Gin Lys Gly Ser Ser Phe Asp Phe Asp Pro Ser 
15 20 25 

GCT CCT CCA CCG TTT AAG ATT GCA GAA ATC AGA GCT TCA ATA CCA AAA 986 
Ala Pro Pro Pro Phe Lys lie Ala Glu Tie Arg Ala Ser He Pro Lys 
30 35 40 

CAT TGC TGG GTC AAG AAT CCA TGG AGA TCC CTC AGT TAT GTT CTC AGG 1034 
His Cys Trp Val Lys Asn Pro Trp Arg Ser Leu Ser Tyr Val Leu Arg 
45 50 55 60 

GAT GTG CTT GTA ATT GCT GCA TTG GTG GCT GCA GCA ATT CAC TTC GAC 1082 
Asp Val Leu Val lie Ala Ala Leu Val Ala Ala Ala lie His Phe Asp 
65 * 70 75 

AAC TGG CTT CTC TGG CTA ATC TAT TGC CCC ATT CAA GGC ACA ATG TTC 1130 
Asn Trp Leu Leu Trp Leu He Tyr Cys Pro He Gin Gly Thr Met Phe 
80 85 90 
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TGG GCT CTC TTT GTT CTT GGA CAT GAT TGT GGC CAT GGA AGC TTT TCA 1178 
Trp Ala Leu Phe Val Leu Gly His Asp Cys Gly His Gly Ser Phe Ser 
95 100 105 

GAT AGC CCT TTG CTG AAT AGC CTG GTG GGA CAC ATC TTG CAT TCC TCA 1226 
Asp Ser Pro Leu Leu Asn Ser Leu Val Gly Mis lie Leu His Ser Ser 
110 115 120 

ATT CTT GTG CCA TAC CAT GGA TGG AGA ATT AGC CAC AGA ACT CAC CAT 1274 
lie Leu Val Pro Tyr His Gly Trp Arg lie Ser His Arg Thr His His 
125 130 135 140 

GAA AAC CAT GGA CAC ATT GAG AAG GAT GAG TCA TGG GTT CCA TTA ACA 1322 
Gin Asn His Gly His He Glu Lys Asp Glu Ser Trp Val Pro Leu Thr 
145 150 155 

GAG AAG ATT TAC AAG AAT CTA GAC AGC ATG AGA AGA CTC ATT AGA TTC 1370 
Glu Lys He Tyr Lys Asn Leu Asp Ser Met Thr Arg Leu lie Arg Phe 
160 165 170 

ACT GTG CCA TTT CCA TTG TTT GTG TAT CCA ATT TAT TTG TTT TCA AGA 1418 
Thr Val Pro Phe Pro Leu Phe Val Tyr Pro He Tyr Leu Phe Ser Arg 
175 180 185 

AGC CCC GGA AAG GAA GGC TCT CAC TTC AAT CCC TAC AGC AAT CTG TTC 1466 
Ser Pro Gly Lys Glu Gly Ser His Phe Asn Pro Tyr Ser Asn Leu Phe 
190 195 200 



CCA CCC AGT GAG AGA AAA GGA ATA GGA ATA TCA ACA CTG TGT TGG GCT 1S14 
Pro Pro Ser Glu Arg Lys Gly lie Ala He Ser Thr Leu Cys Trp Ala 
205 210 215 220 

ACC ATG TTT TCT CTG CTT ATC TAT CTC TCA TTC ATA ACT AGT CCA CTT 1562 
Thr Met Phe Ser Leu Leu He Tyr Leu Ser Phe He Thr Ser Pro Leu 
225 230 235 

CTA GTG CTC AAG CTC TAT GGA ATT CCA TAT TGG ATA TTT GTT ATG TGG 1610 
Leu Val Leu Lys Leu Tyr Gly lie Pro Tyr Trp He Phe Val Met Trp 
240 245 250 



CTG GAC TTT GTC ACA TAC TTG CAT CAC CAT GGT CAC CAC CAG AAA CTG 1658 
Leu Asp Phe Val Thr Tyr Leu His His His Gly His His Gin Lys Leu 
255 260 265 

CCT TGG TAC CGC GGC AAG GAA TGG AGT TAT TTA AGA GGT GGC CTC ACC 1706 
Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr 
270 275 280 

ACT GTG. GAT CGT GAC TAT GGT TGG ATC TAT AAC ATT CAC CAT GAC ATT 1754 
Thr Val Asp Arg Asp Tyr Gly Trp He Tyr Asn He His His Asp He 
285" 290 / 295 300 

GGC ACC CAT GTT ATC CAC CAT CTT TTC CCC CAA ATT CCT CAT TAT CAC 1802 
Gly Thr His Val lie His His Leu Phe Pro Gin He Pro His Tyr His 
305 310 315 



WO 93/1 1245 



PCT/US92/10284 



14 6 



CTC GTT GAA GCG ACA CAA GCA GCA AAA CCA GTT CTT GGA GAT TAC TAC 1850 
Leu Val GXu Ala Thr Gin Ala Ala Lys Pro Val Leu Gly Asp Tyr Tyr 
320 325 330 

CGT GAG CCA GAA AGA TCT GCG CCA TTA CCA TTT CAT CTA ATA AAG TAT 1898 
Arg Glu Pro Glu Arg Ser Ala Pro Leu Pro Phe His Leu He Lys Tyr 
335 340 345 

TTA ATT CAG AGT ATG AGA CAA GAC CAC TTC GTA AGT GAC ACT GGA GAT 1946 
Leu He Gin Ser Met Arg Gin Asp His Phe Val Ser Asp Thr Gly Asp 
350 355 360 

GTT GTT TAT TAT CAG ACT GAT TCT CTG CTC CTC CAC TCG CAA CGA GAC 1994 
Val Val Ty* Tyr Gin Thr Asp Ser Leu Leu Leu His Ser Gin Arg Asp 
365 370 375 380 

TGAGTTTCAA ACTTTTTGGG TTATTATTTA TTGGATTCTA GCTACTCAAA TTACTTTTTT 2054 

TTTAATGTTA TGTTTTTTGG AGTTTAACGT TTTCTGAACA ACTTGGAAAT TACTTGCATA 2114 

GAGAGACATG GAATATTTAT TTGAAATTAG TAAGGTAGTA ATAATAAATT TTGAATTGTC 2174 

AGTTTCA 2181 



(2) INFORMATION FOR SEQ ID NO: 11; 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met VaX Lys Asp Thr Lys Pro Leu Ala Tyr Ala Ala Asn Asn Gly Tyr 
1 5 10 15 

Gin Gin Lys Gly Ser Ser Phe Asp Phe Asp Pro Ser Ala Pro Pro Pro 
20 25 30 

Phe Lys He Ala Glu He Arg Ala Ser He Pro Lys His Cys Trp Val 
35 40 45 

Lys Asn Pro Trp Arg Ser Leu Ser Tyr Val Leu Arg Asp Val Leu Val 
50 55 60 

lie Ala Ala Leu Val Ala Ala Ala He His Phe Asp Asn Trp Leu Leu 
65 70 * 75 80 

Trp Leu lie Tyr Cys Pro He Gin Gly Thr Met Phe Trp Ala Leu Phe 
85 90 95 

Val Leu Gly His Asp Cys Gly His Gly Ser Phe Ser Asp Ser Pro Leu 
100 105 HO 
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Leu Asn Ser Leu Val Gly His lie Leu Kis Ser Ser lie Leu Val Pro 
115 120 125 

Tyr His Gly Trp Arg lie Ser His Arg Thr His His Gin Asn His Gly 
130 135 140 

His lie Glu Lys Asp Glu Ser Trp Val Pro Leu Thr Giu Lys lie Tyr 
145 150 155 160 

Lys Asn Leu Asp Ser Met Thr Arg Leu He Arg Phe Thr Val Pro Phe 
165 170 175 

Pro Leu Phe Val Tyr Pro lie Tyr Leu Phe Ser Arg Ser Pro Gly Lys 
180 185 190 

Glu Gly Ser His Phe Asn Pro Tyr Ser Asn Leu Phe Pro Pro Ser Glu 
195 200 205 

Arg Lys Gly He Ala He Ser Thr Leu Cys Trp Ala Thr Met Phe Ser 
210 215 220 

Leu Leu He Tyr Leu Ser Phe He Thr Ser Pro Leu Leu Val Leu Lys 
225 230 235 240 

Leu Tyr Gly He Pro Tyr Trp He Phe Val Met Trp Leu Asp Phe Val 
245 250 255 

Thr Tyr Leu His His His Gly His His Gin Lys Leu Pro Trp Tyr Arg 
260 265 270 

Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr Val Asp Arg 
275 280 285 

Asp Tyr Gly Trp He Tyr Asn He His His Asp lie Gly Thr His Val 
290 295 300 

He His His Leu Phe Pro Gin He Pro His Tyr His Leu Val Glu Ala 
305 310 315 320 

Thr Gin Ala Ala Lys Pro Val Leu Gly Asp Tyr Tyr Arg Glu Pro Glu 
325 330 335 

Arg Ser Ala Pro Leu Pro Phe His Leu He Lys Tyr Leu He Gin Ser 
340 345 350 

Met Arg Gin Asp His Phe Val Ser Asp Thr Gly Asp Val Val Tyr Tyr 
355 360 365 

Gin Thr. Asp Ser Leu Leu Leu His Ser Gin Arg Asp 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1675 base pairs 

(B) TYPE: nucleic acid 
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(C J STRANDEDNES S : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Glycine max 
(vii), IMMEDIATE SOURCE: 

(B) CLONE: pSFD-118bwp 
(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 169.. 1530 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 12 r 

CTGTGGCAAT TTTTCTCTTC TCCTTCTGGT TCTGATCTTT GTGTTCTTCT TTGTTTCTCA €0 

CCTTTCTGAG GATTTTTCCA TCTTAGTTCC TGGAGGCACC AGGAACCTGA CCAAATAAAT 120 

AAACCTTTTT TTTCTTCTAA TTTTTCTGAA GTTTCATTTT TTAGTCCA ATG GCA ACT 177 

Mot Aid Tnr 

i 

TGG TAT CAT CAG AAA TGT GGC TIG AAG CGT CTT GCT CCA GTA ATT CCT 225 
Tro Tyr His Gin Lys Cys Gly Leu Lys Pro Leu Ala Pro Val lie Pro 
' 5 10 15 

AGA CCT AGA ACT GGG GCT GCT TTG TCC AGC ACC TCA AGG GTT GAA TTT 
Arg Pro Arg Thr Gly Ala Ala Leu Ser Ser Thr Ser Arg Val Glu Phe 
20 25 30 35 

TTG GAC ACA AAG AAG GTA GTG GCA GGT CCT AAG TTT CAA CCT TTG AGG 
Leu Asp thr Asn Lys Val Val Ala Gly Pro Lys Phe Gin Pro Leu Arg 
40 45 50 

TGC AAC CTC AGG GAG AGG AAT TGG GGG GTG AAA GTG AGT GCC CCT TTG 
Cys Asn Leu Arg Glu Arg Asn Trp Gly Leu Lys Val Ser Ala Pro Leu 
55 60 65 

AGS GTT GCT TCC ATT GAA GAG GAG CAA AAG AGT GTT GAT TTA ACC AAT 
Arg Val Ala Ser He Glu Glu Glu Gin Lys Ser Val Asp Leu Thr Asn 
- 70 75 80 

GGG ACT AAT GGG GTT GAG CAT GAG AAG CTT CCA GAA TTT GAC CCT GGT 
Gly Thr Asn Gly Val Glu His Glu Lys Leu Pro Glu Phe Asp Pro Gly 
85 90 95 

GCT CCG CCA CCA TTC AAC TTG GCT GAT ATT AGA GCA GCC ATT CCA AAG 
Ala Pro Pro Pro Phe Asn Leu Ala Asp He Arg Ala Ala lie Pro Lys 
100 105 "-0 US 



273 



321 



369 



41? 



465 



513 
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CCT TGG TAC CGT GGA AAG GAA TGG AGC TAC CTC AGG GGT GGT CTA ACT 
Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr 
340 345 350 355 

ACT CTT GAT CGT GAT TAT GGA TGG ATC AAT AAC JiTZ CAC GAT GAC ATT 
Thr Leu Asp Arg Asp Tyr Gly Trp He Asn Asn lie His His Asp He 
360 355 370 

GGC ACT CAT GTC ATT CAT CAC CTA TTT CCT CAA ATT CCA CAC TAT CAC 
Gly Thr His Val lie His His Leu Phe Pro Gin lie Pro His Tyr His 
375 380 385 

TTA CTT GAG GCT ACT GAG GCT GCT AAG CCA GTG TTT GGA AAA TAT TAT 
Leu Val Glu Ala Thr Glu Ala Ala Lys Pro Val Phe Gly Lys Tyr Tyr 
390 395 400 

AGA GAA CCA AAG AAA TCA GCA GCA CCT CTT CCT TTT CAC CTT ATT GGG 
Arg Glu Pro Lys Lys Ser Ala Ala Pro Leu Pro Phe His Leu He Gly 
405 410 415 

GAA ATA ATA AGG AGC TTC AAG ACT GAC CAT TTT GTT AGT GAC ACG GGG 
Glu lie lie Arg Ser Phe Lys Thr Asp His Phe Val Ser Asp Thr Gly 
420 425 430 435 

GAT GTT GTG TAC TAT CAA ACC GAC TCT AAG ATT AAT GGC TCT TCC AAA 
Asp Val Val Tyr Tyr Gin Thr Asp Ser Lys He Asn Gly Ser Ser Lys 
440 445 450 

TTA GAG TGAATATTAA AATTCTTTTC TATATAGACA AGAGAGGCTT ATACACAATT 
Leu Glu 

CTTATTGCTT TAAAGATTGT CTTGAGTTTC TCCGAAAGTT ACTGCACTTA CTTGGAGTTG 
AATCCTTCAT TAATAAAGGG ATGGATGGAT CATATAAA 



1233 
1281 
1329 
1377 
1425 
1473 
1521 

1577 

1637 
1675 / 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 

(B) TXPE; - amino acid 
(D) TOPOLOGT : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION r SEQ ID NO: 13: 

Met Ala Thr Trp Tyr His Gin Lys Cys Gly Leu Lys Pro Leu Ala Pro 
1 5 10 15 

Val He Pro Arg Pro Arg Thr Gly Ala Ala Leu Ser Ser Thr Ser Arg 
20 25 30 

Val Glu Phe Leu Asp Thr Asn Lys Val Val Ala Gly Pro Lys Phe Gin 
35 40 45 
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Pro I*eu Arg Cys Asn Leu Arg Glu Arg Asn Trp Glv Leu Lys Val Ser 
50 55 60 

Ala Pro Leu Arg Val Ala Ser lie Glu Glu Glu Gin Lys Ser Val Asp 
65 70 75 80 

Leu Thr Asn Gly Thr Asn Gly Val Glu His Glu Lys Leu Pro Glu Phe 
85 90 95 

Asp Pro Gly Ala Pro Pro Pro Phe Asn Leu Ala Asp He Arg Ala Ala 
100 105 110 

He Pro Lys His Cys Trp Val Lys Asp Pro Trp Arg Ser Met Ser Tyr 
115 120 125 

Val Val Arg Asp Val He Ala Val Phe Gly Leu Ala Ala Ala Ala Ala 
130 135 140 

Tyr Leu Asn Asn Trp Leu Val Trp Pro Leu Tyr Trp Ala Ala Gin Gly 
145 150 155 160 

j 

Thr Met Phe Trp Ala Leu Phe Val Leu Gly His Asp Cys Gly His Gly 
165 170 175 

Ser Phe Ser Asn Asn Ser Lys Leu Asn Ser Val Val Gly His Leu Leu 
180 185 190 

His Ser Ser He Leu Val Pro Tyr His Gly Trp Arg He Ser His Arg 
195 200 205 

Thr His His Gin His His Gly His Ala Glu Asn fisp Glu Ser Trp His 
210 215 220 

Pro Leu Pro Glu Lys Leu Phe Arg Ser Leu Asp Thr val Thr Arg Met 
225 230 235 240 

Leu Arg Phe Thr Ala Pro Phe Pro Leu Leu Ala Phe Pro Val Tyr Leu 
245 250 255 

Phe Ser Arg Ser Pro Gly Lys Thr Gly Ser His Phe Asp Pro Ser Ser 
260 265 270 

Asp Leu Phe Val Pro Asn Glu Arg Lys Asp Val lie Thr Ser Thr Ala 
275 280 285 

Cys Trp Ala Ala Met Leu Gly Leu Leu Val Gly Leu Gly Phe Val Met 
290 295 300 

Gly Pro He Gin Leu Leu Lys Leu Tyr Gly Val Pro Tyr Val He Phe 
305 310 m 315 320 

Val Met Trp Leu Asp Leu Val Thr Tyr Leu His His His Gly His Glu 
325 330 335 

Asp Lys Leu Pro Trp Tyr Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly 
340 345 350 
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Gly Leu thr Thr Leu Asp Arg Asp Tyr Gly Trp lie Asn Asn He His 
355 360 365 

His Asp He Gly Thr His Val He His His Leu Phe Pro Gin He Pro 
370 375 380 

His Tyr His Leu Val Glu Ala Thr Glu Ala Ala Lys Pro Val Phe Gly 
385 390 395 400 

Lys Tyr Tyr Arg Glu Pro Lys Lys Ser Ala Ala Pro Leu Pro Phe His 
405 410 415 

Leu He Gly Glu He He Arg Ser Phe Lys Thr Asp His Phe Val Ser 
420 425 430 

Asp Thr Gly Asp Val Val Tyr Tyr Gin Thr Asp Ser Lys He Asn Gly 
435 440 445 

Ser Sex Lys Leu Glu 
450 

(2) INFORMATION FOR SEQ ID NO: 14: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 396 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: pPCR20 
(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 31.. 363 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 14: 1 

GGATCCACGC ATCATCAGAA TCACGGTCAC ATCCACAGGG ACGAGTCATG GCACCCGATC 60 

ACGGAGAAGC TGTACCGGCA ACTAGAGCCA CGCACCAAGA AGCTGAGATT CACGGTGCCC 120 

TTCCCCCTGC TCGCATTCCC CGTCTACCTC TTGTACAGGA GCCCCGGCAA GCTCGGCTCC 180 

CACTTCCTTC CCAGCAGCGA CCTGTTCAGC CCCAAGGAGA AGAGCGACGT CATGGTGTCA 240 
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ACCACCTGGT GGTGCATCAT GCTCGCCTCC CTCCTCGCCA TGGCGTGCGC GTTCGGCCCA 300 
CTCCAGGTGC TCAAGATGTA CGGCATCCCA TACCTGGTGT TCGTGATGTG GCTTGACCTG 360 
GTGACGTACT TACATCACCA CGGCCACGAT GGATCC 396 

(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS : 

(A). LENGTH: 126 amino acids 

<B) TYPE: amino a ci 4 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iil) HYPOTHETICAL : YES 

(v) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 
(vii> IMMEDIATE SOURCE: 

(B) CLONE: pPCR20 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 15: 

His His Gin Asn His Gly His lie His Arg Asp Glu Ser Trp His Pro 
1 5 10 15 

lie Thr Glu Lys Leu Tyr Arg Gin Leu Glu Pro Arg Thr Lys Lys Leu 
20 25 30 

Arg Phe Thr Val Pro Phe Pro Leu Leu Ala Phe Pro Val Tyr Leu Leu 
35 40 45 

Tyr Arg Ser Pro Gly Lys Leu Gly Ser His Phe Leu Pro Ser Ser Asp 
50 55 60 

Leu Phe Ser Pro Lys Glu Lys Ser Asp Val; Met val Ser Thr Thr cys 
65 70 75 80 

Trp Cys lie Met Leu Ala Ser Leu Leu Ala Met Ala Cys Ala Phe Gly 
85 90 95 

Pro Leu Gin Val Leu Lys Met Tyr Gly He Pro Tyr Leu Val Phe Val 
100 105 110 

Met Trp Leu Asp Leu Val Thr Tyr Leu His His His Gly His 
115 120 125 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CGGGATCCAC NCAYCAYCAR AAYCAYGGNC A 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /note- "N» INOSINE" 
(ix) FEATURE: 

(A) NAME /KEY: misc feature 
•(B) LOCATION: 16.. 35 

(D) OTHER INFORMATION: /note- W N~ A OR T OR G OR 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19r 
CGGGATCCRT CRTGNCCRTG RTGRTGNARR TANGT 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..36 

(D) OTHER INFORMATION: /note= tt N= INOSINE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTCGTNNTNG GNCAYGAYTG YGGNCAYGGN CAYGGNAGNT TC 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: miac feature 

(B) LOCATION: 1..36 

(D) OTHER INFORMATION: /note= "N- INOSINE' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TTCGTNNTNG GHCAYGAYTG YGGNCAYGGN TCNTTC 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS^ 

(A) LENGTH: 18 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GGHCAYGAYT GYGGHCAC 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GGHCAYGAYT GYGGHCAT 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



GTACTRTARC CDTGDGTR 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GTGCTRTARC CDTGDGTR 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTRCANTARG TRGTRAAYAA YGG 



(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 23 base pairs 

(B) . TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY:- linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GTRCANTADG TRGTRGADAA YGG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..36 

(D) OTHER INFORMATION: /note= "N- INOSINE" 



38 
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(D) OTHER INFORMATION: /note- rt N~ INOSINE" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTRCTRTANC CNTGNGTNCA NTANGTGGTG RANAAGGG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 base pairs 

<B) TYPE: nucleic acid 

(C) STOANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1.-135 

(D) OTHER INFORMATION: /note« W N« INOSINE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GTGGTGNGTN CNGTGNGANA NNCKCCANCC GTGGTANGGN ACNANNANGA ANGANGAGTG 60 
NANNANGTGN CCNACNANNG AGTTNANNAN NGGNATNTCN GAGAANGANC CGTGNCCGCA 120 

138 

NTCGTGNCCN ANNACGAA 
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CLAIMS 

1. An isolated nucleic acid fragment comprising a 
nucleic acid sequence encoding a fatty acid desaturase 
or a fatty acid desaturase-related enzyme with an amino 
5 acid identity of 50% or greater to the polypeptide 
encoded by SEQ ID N0S:1, 4, 6, 8, 10, 12, 14 or 16. 

2- The isolated nucleic acid fragment of Claim 1 
wherein the amino acid identity is 65% or greater to the 
polypeptide encoded- by SEQ ID N0S:1, 4, 6, 8, 10, 12, 14 
10 or 16, 

3. The isolated nucleic acid fragment of Claim 1 
wherein the nucleic acid identity is 90% or greater to 
SEQ ID NQS:1, 4, 6, 8, 10, 12, 14 or 16. 

4. An isolated nucleic acid fragment of Claim 1 
15 wherein said fragment is isolated from a plant selected 

from the group consisting of soybean, oilseed Brassica 
species, Arahidopsis thaliana and corn. 

5. A chimeric gene capable of causing altered 
levels of linolenic acid in a transformed plant cell, 

20 the gene comprising a nucleic acid fragment of any of 
Claims 1, 2, or 3, the fragment operably linked to 
suitable regulatory sequences. 

6. Plants containing the chimeric genes of 
Claim 5 . 

25 7. Oil obtained from seeds of the plants 

containing the chimeric genes of Claim 5. 

8. A method of producing seed oil containing 
altered levels of linolenic (18:3) acid comprising: 

(a) transforming a plant cell of an oil- 
30 producing species with a chimeric gene of Claim 5; 

(b) growing fertile plants from the 
transformed plant cells of step (a); 

(c) screening progeny seeds from the fertile 
plants of step (b) for the desired levels of linolenic 

35 (18:3) acid; and 
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(d) processing the progeny seed of step (c) 
to obtain seed oil containing altered levels of 
linolenic (18:3) acid. 

9 The product of the method of Claim 8, 
5 10. A method of Claim 8 wherein said plant cell of 

an oil-producing species is selected from the group 
consisting of frnrivMrmsis ttmliana, soybean, oilseed 
Rrassica species, sunflower, cotton, cocoa, peanut, 
saf flower, and corn. 
10 ^11- A method of breeding plant species producing 

altered levels of linolenic acid in the seed oil of oil- 
producing plant species comprising: 

(a) making a cross between two varieties of 
oil-producing species differing in the linolenic acid 

15 trait; 

(b) making a Southern blot of restriction 
enzyme digested genomic DNA isolated from several 
progeny plants resulting from the cross of step (a) ; and 

(c) hybridizing the Southern blot with a 
20 radiolabeled nucleic acid fragment of Claim 1. 

12. The product of the method of Claim 11. 

13. A method of RFLP mapping in a genomic RFLP 

marker comprising: 

(a) making a cross between two varieties of 

25 plants; 

(b) making a Southern blot of restriction 
enzyme digested genomic DNA isolated from several 
progeny plants resulting from the cross of step (a) ; and 

(c) hybridizing the Southern blot with a 
30 radiolabeled nucleic acid fragments of Claim 1. 

14. A method to isblate nucleic acid fragments 
encoding fatty acid desaturases and fatty acid 
desaturase-related enzymes, comprising: 



WO 93/1 1245 



153 



PCT/US92/ 10284 



(a) comparing SEQ ID N0S:2, 5, 7, 9, 11, 13, 
15 and : 17 with other fatty acid desaturase polypeptide 
sequences; 

(b) identifying the conserved sequence (s) of 
5 4 or more amino acids obtained in step (a); 

(c) making region-specific nucleotide 

probe (s) or oligomer (s) based on the conserved sequences 
identified in step b; and 

(d) using the nucleotide probe (s) or 

10 oligomers (s) of step c to isolate sequences encoding 

fatty acid desaturase s and fatty-acid desaturase-related 
enzymes by sequence-dependent protocols, 

15. The product of the method of Claim 14. 

16. The isolated genomic DNA of Arabidopsis 
15 t ha liana identified by accession number ATCC 75167. 

17 ♦ An isolated cDNA clone which encodes for 
soybean deita-15 desaturase, the clone designated pXFl 
comprising the DNA sequence of SEQ ID NO 10 and 
identified by accession number ATCC 68874. 
20 IB. An isolated cDNA clone which encodes for 

oilseed Brassier species delta-15 desaturase, the clone 
designated pBNSF3 comprising the DNA sequence of SEQ ID 
NO: 6 and identified by accession number ATCC 68854. 

19. An isolated Polymerase Chain Reaction Product 
25 for Zea maxs. delta-15 desaturase, the clone designated 
pcr20 comprising the DNA sequence of SEQ ID NO: 14. 
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